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(57) Abstract 



The present invention relates to polypeptides and proteins having nitroreductase activity. The invention also relates to DNA 
and genes encoding these nitroreductases, and to methods of obtaining such enzymes, DNA and genes. In a particularly preferred 
aspect, the nitroreductase enzymes demonstrate preferential catalytic conversion of the alkylating agent CB1954 into its highly cytotoxic 
4-hydroxylamine (4HX) derivative, this derivative demonstrating anticarcinoma properties. Accordingly, the catalytic activity of the 
nitroreductase enzymes of the present invention may be employed to achieve catalysis of CB1954 into its cytotoxic derivative in a 
site-directed manner, such as by Directed-Enzyme Prodrug Therapy (DEPT). 
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NITROREDUCTASE ENZYMES 

The present invention relates to polypeptides and proteins ; '".having 
nitroreductase activity, to DNA and genes encoding these nitroreductases 
and to methods of obtaining such enzymes, DNA and genes. 

A number of cancer therapies are based upon or exploit the conversion of 
a non-toxic prodrug into a toxic derivative. 

One example concerns the monofunctional alkylating agent CB1 954, which 
exhibits extreme toxicity towards the Walker 256 rat carcinoma as a result 
of the presence of a DT-diaphorase enzyme (DTD) which reduces the 4- 
nitro group of CB1954 to give a highly cytotoxic 4 : hydroxylamine (4HX) 
derivative. CB1 95.4 does not have the same effect on human carcinomas 
because human cells lack this enzyme but would be effective against 
human tumours if an enzyme such as DTD were externally supplied, e.g. 
in a Directed-Enzyme Prodrug Therapy (DEPT). The rat DTD, however, has 
a relatively poor specific activity for CB1954. The E.coli B nitroreductase 
enzyme (NfnB) was isolated as a more effective alternative and is the 
subject of EP-A-0540263. It exhibits a higher specific activity for CB1 954, 
compared with the rat enzyme and is, therefore, currently the preferred 
enzyme in anti-cancer DEPT strategies. 

Whilst the known £.co//enzyme receives widespread attention from cancer 
biologists seeking to develop gene based DEPT strategies, it has a number 
of drawbacks. These mostly relate to its activity against the preferred 
prodrug, CB1 954 - it has a relatively high K m and low K cat , and converts 
CB1 954 into equimotar amounts of a relatively innocuous 2-hydroxylamino 
derivative (2HX) in addition to the highly cytotoxic 4-hydroxylamino species 
(4HX). 



In relation to this specific prodrug, it is hence desired to provide 
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alternative to the known E.coli enzyme. 

Additionally, and more generally, analogues of CB1 954 and prodrugs other 
than CB1954 are known and further such precursors of potential toxic 
agents may become the focus of future therapies. In relation to all of these 
it is desired to provide further enzymes capable of use in converting 
prodrugs into drugs, e.g. for clinical uses. 

It is an object of the.present invention to provide nitroreductase enzymes, 
in particular nitroreductase enzymes for converting CB1 954 and analogues 
thereof into. drugs. It is a further object of the present invention to provide 
DNA and genes encoding nitroreductases, which DN A and genes in 
particular are incorporated into pharmaceutical compositions for prodrug 
therapies. 

The present invention is based upon the discovery, purification, gene 
sequencing and/or expression of nitroreductases in bacteria and other 
microorganisms with hitherto unknown properties in converting prodrugs 
such as CB1954 into toxic derivatives. These nitroreductases posses 
properties which alone or in combination offer potential improvements 
compared with the known enzymes in this technology. The nitroreductases 
of the invention may be divided into different families based upon such 
characteristics as activity, product spectrum and/or amino acid sequence, 
and each given nitroreductase may fall into more than one of these 
families. 

The present invention provides, in a first aspect, a nitroreductase enzyme, 
characterised in that it preferentially reduces CB1 954 to a product that is 
a cytotoxic 4-hydroxylamihe (4HX) derivative. 

The enzymes of this aspect of the present invention confer the advantage 
that the product they generate from CB1 954 contains a greater proportion 
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of the cytotoxic 4HX derivative then the rion-cytotoxic 2-hydroxylamino 
derivative. In preferred embodiments of the invention, the product is 
' substantially entirely the cytotoxic derivative. The. enzymes may. hence be- 
more efficient that those of the art as the enzymes of the invention 
produce more cytotoxic product for a given amount of pro-drug. 

The present invention further provides, in a second aspect, a nitroreductase 
enzyme, characterised in that it reduces a prodrug to a toxic derivative with, 
a K m of less 700 micromolar, wherein the prodrug is selected from CB1 954 
and analogues thereof or other bioreductive drugs (Denny, et a I, B.J. 
Cancer, 1996, 74; pp S32-S38) . The enzymes of the second aspect of the 
invention offer an advantage over the known E. coli- derived enzyme in that 
they have a lower K m (K m of E.coli NfnB for CB1954 is around 862. 
micromolar) and thus have a higher affinity for substrate. Twenty nitrogen ' 
mustard analogues Of CB1 954 are described by Friedlos et al (J Med Chem, 
1997,40,1270-1275). 

More preferably, the K m of the enzymes of the second aspect of the 
invention is less than 300 micromolar. . 

In a third aspect, the present invention provides a nitroreductase enzyme 
characterised in that it reduces a prodrug to a toxic derivative with a K cit 
of at least 8, wherein the prodrug is selected from CB1 954 and analogues 
thereof. 

The enzymes of this aspect of the invention offer an improvement over that 
of the art, specifically the E.coli enzyme, in that they have an improved K cai 
- i.e a higher value than lor E.coli NfnB indicating a higher turnover of 
substrate by the. enzyme. In preferred embodiments of this aspect, of the 
inveintion, the K cat of the enzymes is at least 10. 



n a fourth aspect of the invention, there is provided a nitroreductase 
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enzyme .characterised in that it reduces GB1954 to a toxic derivative, it 
reduces SN23862 to a toxic derivative, it can use. NADH and/or NADPH as 
electron donor and in that it shares no- more than 50% sequence identity 
with the E.coli NfnB sequence. Preferably, the sequence identity is about 
. 25% or less,. this sequence identity being measured using the MEG ALIGN, 
(registered trade mark) software. 

It has already been discussed how the known E.coli nitroreductase is well 
characterised and is fully sequenced. The nitroreductases of the fourth 
aspect thus represent a class of enzymes having nitroreductase activity, or 
being nitroreductase-iike, which nevertheless are so different in amino acid 
sequence from the E.coli enzyme as to represent a separate family of 
nitroreductases. 

This aspect of the invention thus advantageously provides a further class 
of nitroreductase enzymes for use e.g. in prodrug therapies. 

The invention still further provides, in a fifth aspect, a nitroreductase 
enzyme characterised in that it reduces CB1 954 or an analogue thereof to 
a toxic derivative, in that it shares at least 50% sequence identity with the 
rat DTD sequence and in that it does not contain a domain that is the same 
as or corresponds to amino acids 51 to 82 of the rat DTD sequence. 

Sequence identity is suitably measured in the same way as described above - 
in relation to the fourth aspect. 

To determine whether a given nitroreductase contains a domain that is the 
same as or corresponds to amino acids 51 to 82 of the rat DTD sequence, 
the amino acid sequence of the given nitroreductase and of the rat DTD 
sequence are aligned using a conventional sequence alignment program, 
such as MEGALIGN (registered trade mark) made by DNASTAR, Inc. 
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if the alignment program indicates that there are no amino acids in the 
given sequence that, following the algorhythm of the program, are held to 
correspond to those at positions 51-82 of the rat DTD sequence then it is 
concluded that the rat domain is lacking from the given sequence. 

5 

This aspect of the invention thus provides a further class of nitroreductase 
enzymes for conversion e.g. of prodrugs into drugs. A nitroreductase in 
this class may also be obtained by deleting amino acid residues that 
correspond to residues 51-82 of the rat DTD from a known mammalian 
10 enzyme. 

The nitroreductases of the invention may also be NADPH dependant. This 
property further distinguishes some enzymes of the invention from the 
known E.coli enzyme and the rat DTD. 

It has been found that enzymes having one or more of the properties 
described may be obtained from bacteria of the family Bacillus, in particular 
a Bacillus selected from B. amyloliquefaciens, B. subtilis, B. pumilis, B. 
lautus, B. thermoflavus, B. licheniformis and B. alkophiius. This finding is 
20 of surprise in that at least three nitroreductase enzymes have been found 

in some species, in particular B. subtilis, B. lautus and B. pumilis, and as 
nitroreductases having the advantageous properties of the invention have 
not hitherto been identified in these bacteria, the currently used 
nitroreductase being obtained from E.coli. 

25 

In specific embodiments of the invention described in more detail below, 
a nitroreductase has a sequence selected from SEQ ID Nos 2, 4, 6, 8, 10, 
12, 14, 16, 17, 18, 19, 20, 21, 23, 25, 27 and 29. 

30 It has further been found that nitroreductases according to the invention 

may fall into more than one aspects of the invention. It is hence preferred 
that a nitroreductase of the invention possesses the properties of at least 
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two aspects of the invention, and more preferably at least three aspects of 
the invention. . 



A specific embodiment of the invention is a nitroreductase of SEQ ID NO:2 
obtained from B; amylofiquefaciens this enzyme converts CD1 94 into 
substantially only the cytotoxic derivative, hence falling into the first aspect 
of the invention, but also has a K m that is improved compared to the E.coii 
enzyme, hence falling also into the second aspect of the invention. 

A further specific embodiment of the invention is a nitroreductase from 
B.subtifis, SEQ ID NO:9. This enzyme has a better K cat than the E.coii 
enzyme, its K cat being about 15 compared with about 6 for the E.coii 
enzyme, and hence falls into the third aspect of the invention . Additionally, 
this enzyme falls into the fourth aspect of the invention in that it reduces 
both CB1 954 and SN23862 but shares less than 30% sequence identity 
with the E.coii sequence. Another B.subtiiis enzyme, SEQ ID NO:1 1 is 
similarly in both the third and fourth aspects of the invention, having a K cat 
of about 1.5. ... 

From the examples set out below it will be apparent how the further- 
specific embodiments of the invention fall into at least two and even three, 
aspects of the invention. 

The enzymes of the invention are of use in enzyme directed prodrug 
therapy. Accordingly, it is preferred that they are provided in purified form. 

A sixth aspect of the invention provides a pharmaceutical composition 
comprising a nitroreductase enzyme according to any of the first to fifth . 
aspects of the invention in combination with a pharmaceutically acceptable 
carrier. - 



As mentioned above, the nitroreductase of the invention are of use in 
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therapies such as directed-enzyme prodrug therapies. In these therapies, 
it is required to deliver the nitroreductase to the target site. This delivery 
can be achieved by delivering the enzyme itself or by delivering a DMA or 
gene coding for the enzyme. 

5 

In an example of the enzyme of the invention in use, a pharmaceutical 
composition is designed for a directed-enzyme prodrug therapy, and 
comprises a pharmaceutical^ acceptable carrier and a compound for 
converting a prodrug into a drug, wherein a compound is composed of at 
1 o least a nitroreductase according to any of the first to fifth aspects of the 

invention conjugated to a targeting moiety. 

The targeting moiety can suitably comprise an antibody specific for a target 
cell. Alternatively, the targeting moiety is a moiety preferentially 
15 accumulated by or taken up by a target cell. 

A further example of delivery of the enzyme of the invention is achieved in 
a gene therapy-based approach for targeting cancer cells, as described in 
WO 95/12678. As described by Knox R.J. et al, the basis of this further 

20 prodrug therapy is delivery of a drug susceptibility gene into target, usually 

tumour or cancer, cells. The gene encodes a nitroreductase that catalyses 
the conversion of a prodrug into a cytotoxic derivative. The nitroreductase 
itself is not toxic and cytotoxicity used to treat the tumour cells arises after 
administration of a prodrug which is converted into the cytotoxic form. A 

25 bystander effect may be observed as cytotoxic drug may diffuse into, 

neighbouring cells. 

Thus, in this gene-based therapy, the nitroreductase is expressed inside a 
cell, in contrast to other delivery systems in which, for example, the 
30 enzyme itself is delivered accompanied by a targeting, moiety. 

Targeting of gene-based therapies may be achieved by providing a virus or 
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liposome with altered surface components so that the delivery vehicle is 
recognised by target cells. Typically, transcriptional elements are chosen 
so that the gene coding for the nitroreductase enzyme will be expressed. ip 
the target cells, and preferably substantially only in the target cells. A 
5 number of viral-based vectors are suitable for this delivery. Retro-viral 

based vectors typically infect replicating cells. Adenoviral vectors and 
lentiviral-vectors are also believed to be suitable. 

This delivery technology has been demonstrated by Bridgewater et al (Eur 
10 J Cancer 3ia, 236-2370, 1995). A recombinant retrovirus encoding a 

nitroreductase was used to infect mammalian cells, it being observed that 
infected cells expressing the nitroreductase were killed by application of 
CB1954. 

1 5 Accordingly, a further aspect of the invention provides the use of a DNA 

sequence coding for a nitroreductase of the invention in manufacture of a 
medicament for prodrug therapy. 

The medicament may take the form of a viral vector, comprising a DNA 
20 encoding the nitroreductase of the invention operatively coupled to a 

promoter for expression of the DNA. The medicament may take the form 
of a mini-gene comprising a DNA operatively linked to a promoter for 
expression of the DNA, the mini-gene being suitable for inclusion or 
incorporation into a targeting vehicle such as a microparticle. 

Thus, an embodiment of the invention provides a viral vector comprising 
a nucleotide sequence encoding a nitroreductase according to any of 
aspects 1 to 5 of the invention, which nitroreductase converts a prodrug 
into a cytotoxic drug, and also a kit comprising the viral vector and the 
30 prodrug, and also a method of treatment of tumours which comprises 

administering an effective amount of the viral vector together with an 
effective amount of the prodrug. 
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The. preparation and administration of these viral vectors may be 
" •■■ substantially as described in WO 95/12678, the contents of which is 
■ incorporated herein by reference. The present invention relates to 
providing nitroreductase enzymes and genes and DNA coding therefore. 
The uses of those enzymes and genes may be as set out in WO 95/1 2678. 

A nitroreductase can also be delivered by putting a gene of the invention 
into a bacteria that selectively colonises tumours, such as a clostridial 
(Lemmonet al. Gene Therapy, 1997, 4, 791-796) or Salmonella species. 

A further aspect of the invention provides an isolated DNA encoding a 
nitroreductase according to any of the first to fifth aspects of the invention. 
The DNAs of this further aspect of the invention, and also the DNAs 
incorporated into vectors of the invention, preferably comprise a sequence 
which is selected from SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 22/24, 26 
or 28, together with fragments, derivatives and analogs thereof retaining 
nitroreductase activity according to one of the first to fifth aspects of the 
invention. The fragments, derivatives and analogs are suitably selected 
. from sequences which retain at least 70% identity with the specific 
embodiments of the invention, or preferably at least 90% identity and most 
preferably at least 95% identity. 

The enzymes of the invention can also be obtained by purification from cell 
extracts and may also be obtained by recombinant expression of DNA. A 
25 still further aspect of the invention lies in a method of preparing a 

. nitroreductase enzyme, comprising expressing a gene in a bacterial cell, 
wherein the gene codes for a nitroreductase enzyme of the invention. 

In an example of the invention described below in more detail, the gene 
30 expressed is a Bacillus gene or is a gene obtained by substitution, deletion 

and/or addition of nucleotides in or to a Bacillus gene. 
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The invention also provides the use of a nitroreductase according to any of 
the aspects of the invention in manufacture of a medicament for anti- 
tumour therapy,. and the use of a compound comprising a nitroreductase 
according to any aspect of the invention conjugated to a targeting moiety 
in manufacture of a medicament for anti-tumour therapy. 

The invention is now illustrated by the following specific examples and in 
the accompanying sequence listing in which: 

SEQ ID NO: 2 is a nitroreductase from B.amyloliquefaciens (coded 
for by SEQ ID NO: 1) and designated "Bam YrwO"; 

. SEQ ID NO: 4 is a nitroreductase from B.subtilis (coded for by SEQ 
ID NO: 3) and designated "Bs YwrO"; 

SEQ ID NO: 6 is a nitroreductase from B.subtilis (coded for by SEQ 
ID NO: 5) and designated "YrkL"; 

SEQ ID NO: 8 is a nitroreductase from B.subtilis (coded for by SEQ 
ID NO: 7) and designated "YdeQ"; 

SEQ ID NO: 10 is a nitroreductase from B.subtilis (coded for by SEQ 
ID NO: 9) and designated "Ydgl"; 

SEQ ID NO: 1 2 is a nitroreductase from B.subtilis (coded for by SEQ 
ID NO: 1 1 ) and designated "YodC"; 

SEQ ID NO: 14 is a nitroreductase from E.coli (coded for by SEQ ID 
NO: 13) and designated "YabF" 

SEQ ID NO: 16 is a nitroreductase from E.coli (coded for by SEQ ID 
NO: 15) and designated "YheR"; 

SEQ ID NO: 1 7 is a nitroreductase from H. influenzae; 

SEQ ID NO: 1 8 is a nitroreductase from T.aquaticus; 

SEQ ID NO: 1 9 is a nitroreductase from Synechocystis sp PCC 6803; 

SEQ ID NO: 20 is a nitroreductase from A.fu/gidus; 

SEQ ID NO: 21 is a nitroreductase from A.fufgidus. 

SEQ ID NO: 23 is a nitroreductase from Campylobacter jejuni (coded 

for by SEQ ID NO: 22); 

SEQ ID NO: 25 is a nitroreductase from Porphyromonas gingivalis 
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(coded for by SEQ ID NO: 24); 

SEQ ID NO: 27 is a nitroreductase from Yersinia pestis (coded for by 

SEQ ID NO: 26); and 

SEQ ID NO: 29 is a nitroreductase from Helicobacter pylori (coded. 

5 . for by SEQ ID NO: 28). 

The invention is also illustrated by reference to the accompanying Tables 
1-4 and Figures 1 and 2, in which Figs 1 and 2 show sequence 
comparisons as set out in. more detail in Example 8. 

0 

Example 1 

A Nitroreductase Enzyme/Gene from Bacillus amyloliquefaciens 

Briefly, extracts of Bacillus amyloliquefaciens were shown to possess 
15 nitroreductase activity'. To purify this activity, crude cell extracts were 

subjected to ammonium sulphate, fractionation and anion exchange 
chromatography. The purified material was subject to N-terminal amino 
acid sequence analysis and the information obtained used to cloned the 
gene via a PCR-based strategy. Following determination of its nucleotide 
20 sequence the gene was overexpressed in E. coJi and the resultant 

recombinant protein purified and characterised see table 1 . 

This analysis showed that the enzyme had properties which were distinct 
• from that of E.coli NfnB. Thus the protein had a more favourable K m for 
25 CB1 954 (1 .5-fold lower than the E. coli B NfnB) and furthermore converted 

CB1 954 into the 4HX form alone. It also differed from the E. coli B NfnB 
in that the enzyme showed no activity against the prodrug SN23862. 

The isolated enzyme/gene represents a significant improvement over the 
30 E. coli NfnB enzyme with respect to its activity against the prodrug CB1 954 

ie., it produces only the 4HX derivative and has an improved K m for 
CB1954. 
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A comparison of the amino acid sequence of the isolated enzyme revealed 
that it shared a very low level of homology to the rat DTD (c. 25%), but 
exhibited high homology (70% sequence identity) with the predicted 
product of a gene that has been discovered in the Bacillus subtllis genome 
5 sequencing project, designated ywrO, On this. basis, we have designated, 

the cloned Bacillus amyloliquefaciens geneywrO, and its encoded enzyme 
YwrO. 

YwrO BAM - is a tetrameric flavoprotein (monomeric molecular mass 
10 approximately 22.5 kDa by SDS-PAGE, native molecular mass 

approximately 90 kDa by gel filtration). Although it shares sequence 
homology with rat DTD it differs in its enzymic properties in that it can use 
only NADPH as cofactor (K m 40 pM). In common with DTD it can reduce 
• CB1954 but not SN23862, reduction of CB1954 resulting in formation of 
the 4HX product only (K m 617>M, k cat 8.2). It shows a high affinity for the 
quinone menadione (K m 3.4//M) and has azoreductase and flavin reductase, 
activity <K m for FMN 53 jjM, K m for FAD 209 >M). 

. in more detail N-terminal amino acid sequencing of the purified Bacillus 
20 amyloliquefaciens nitroreductase enzyme resulted in the following. 

. sequence,Met-Ly S -Val-Leu-Val-Leu-Ala-Val-His-Pro-Asp-lv1et-Glu-Asn-Ser- 

Ala-Val-Asn. When this sequence was used to search available protein 
databases strong homology was noted with the predicted amino acid 
sequence of a hypothetical protein, YrkL, identified in the Bacillus subtilis.. 
25 genome sequencing project. Significant homology was also evident with 

. two proteins, YabF and YheR, identified during the course of the 
determination of the Escherichia coli genome. These three hypothetical 
proteins shared weak homology with a number of mammalian quinone 
reductases and NAD(P)H-oxidoreductases, such as the rat DTD. 

In view of this observation, a strategy was formulated whereby sequence 
homology between the identified bacterial proteins, together with the 



15 



30 
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determined N-terminal amino acid sequence of the discovered Bacillus 
amyloliquefaciens enzyme, was used to ampiify a region of the desired 
encoding gene from the Bacillus amyloliquefaciens genome. The one primer 
utilised in PCR was a degenerate oligonucleotide sequence which 
5 corresponded to a DNA sequence capable of coding for the N-terminal 

octa-peptide Val-His-Pro-Asp-Met-Glu-Asn. It was composed of the 
following nucleotides, 5'-GTNCAYCCNGATATGGARAA-3', where Y 
indicates the presence of a T or C, R indicates the presence of A or G, and 
N indicates the presence of either T, C, G or A. The second primer was 
10 based on the hypothetical sequence His-Gly-Trp-Ala-Tyr-Gly which was 

found to be entirely conserved between the hypothetical bacterial proteins 
YrkL (Bacillus subtilis) and YabF (E.coJi), and partially conserved in YheR 
(£. coli) . The degenerate oligonucleotide mixture synthesised corresponded 
to the antisense DNA coding strand, viz., 5'-CCRTANGCCCANCCRTG-3' . 

15 

E.coli YheR (90-95) Arg Gly Phe Ala Ser Gly 

E.coli YabF (84-89) His Gly Trp Ala Tyr Gly 

B. subtilis YrkL (85-90) His Gly Trp Ala Tyr Gly 

20 The two primers were employed in PCR using chromosomal DNA isolated 

from Bacillus amyloliquefaciens and an amplified DNA fragment of the 
expected size (approximately 230 bp) obtained. This was cloned into 
plasmid pCR2.1TOPO (Invitrogen) and its nucleotide sequence determined. 
Translation of the sequence obtained demonstrated the presence of an 

25 open reading frame which encoded a polypeptide which shared 66% 

sequence similarity with YrkL. 

To obtain the entire structural gene, an approach was employed based on 
inverse PCR. In essence, B. amyloliquefaciens DNA was cleaved with the 
30 restriction enzyme Styl and the fragments generated circularised through 

their subsequent incubation with DNA ligase. The ligated DNA was then 
used as the template for a PCR employing two divergent primers based on 
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the sequenced 220 bp fragment. These were BamNTRI 1 (5'- 
GCTTATTG ACCGCTG AG-3' ) and BamNTRI 4 (5'-GTACAGTGCGCCTCCGC- 
3 ). A 2.9 kb fragment was generated, cloned into pCR2.1TOPO. .. 
(Invitrogen) and the sequence of the insert determined. This allowed the 

5 identification of the nucleotide sequence of the remaining parts of the B. 

amyloliquefaciens gene. Using this information, a contiguous copy of the 
entire structural gene was amplified from the B. amyloliquefaciens 
chromosome using primers which encompassed the translations! start 
codon (5'-GGTGTGATACATATGAAAGTATTG-3'J and resided 3' to the 
10 transnational stop codon (5'-CGGGGATTCGAATTCTTTCTCAGG-3') . The 

primer at the 5'-end of the gene was designed such the sequence 
immediately 5' to the ATG start codon became CAT. This change created 
an Nde\ restriction site (CAT ATG), thereby allowing the cloning of the gene 
into the equivalent site of the expression vector pMTL10T5. This 

15 manipulation facilitated the subsequent overexpression of the gene, as 

insertion of the gene at this point positions the start codon at an optimum 
distance from the vector borne ribosome binding site. 

The strategy employed to clone the BM YwrO gene could be similarly 
20 employed to clone further genes encoding novel nitroreductases. This 

would involve purifying the desired enzyme activity from a cell lysate, and 
then determining the N-terminal sequence. The data obtained could then 
be used to design an oligonucleotide primer corresponding to the sense 
strand of the'DNA encoding part or all of the determined amino acid 
25 sequence. This primer could then be used, in conjunction with a second 

primer, to amplify part of the gene encoding the nitroreductase from the 
chromosome of the bacterial host using PCR. The second primer would 
correspond to the antisense strand of an internal portion of the targeted 
gene. Its design would be based on regions of homology which are 
30 conserved amongst the type of nitroreductase family that is sought. Thus, 

in the case of the DTD-like family, the oligonucleotide would, for example 
be based on the conserved motif His-Gly-Trp-Ala-Tyr-Gly (ie., amino acid 
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residues 85-90 in the BS YrkL protein) . In the case of the NfnB-like family, 
the oiigonucleotdie could be based on the motif Glu-Arg-Tyr-Val-Pro-Val- 
Met (ie.. amino acid residues 170-176 in the BS YodC protein). 

Such amplified fragments could then be cloned and sequenced, and new 
primers designed based on this sequence to isolate the flanking regions of 
the gene by PCR. Once these have been cloned and sequenced, the entire, 
contiguous structural gene may be amplified using primers which extend 
beyond the 5' and 3' end of the translational start and stop codons. 

Cloning of genes encoding novel nitroreductases may also be achieved 
without recourse to N-terminal sequencing of the enzyme, or even its 
purification. This would involve basing the sequence of both of the 
oligonucleotides used in the initial PCR reaction on amino acid sequence 
motifs conserved amongst the two identified nitroreductase families. 
Thus, in the case of the NfnB-like family, a sense primer (eg., 5'- 
ATTTCTAAAGAAGAGCTGACGGAA-3') based on the motif lle-Ser-Lys-Glu- 
Glu-Leul-Thr-Glu (ie., amino acid residues 13 to 20 of BS YodC) could be 
employed with the an antisense primer (eg., 5'- 
CATTACCGGTACATAGCGTTC-3') based on the sequence motif Glu-Arg- 
Tyr-Val-Pro-Val-Met He'.'., amino acid residues 170 to 176). In the case of 
the DTD-family a sense primer (eg., 5'-CATCCGGATATGGAAAAT-3') 
based on the motif His-Pro-Asp-Met-Glu-Asn (ie., amino acid residues to 9 
to 14 of BM YwrO) could be employed with the an antisense primer (eg., 
5'-TCCATATGCCCATCCATA-3') based on the sequence motif Tyr-Gly- 
Trp-Ala-Tyr-Gly (ie., amino acid residues 85 to 90). Once amplified, the 
rest of the gene could be isolated using the same procedure as outlined 
above. 

Example 2 



Bacillus subtilis Nitroreductases 
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As indicated above in Example 1 , comparative analysis of the B. subtilis 
genome sequence with the amino acid sequence of the isolated. B. 
' amyloliquefaciens enzyme demonstrated the existence of an enzyme ., 
. (YwrO) which shared 70% sequence identity. Unexpectedly, B.subtilis was 
found to possess two homologues, YrkL and YdeQ, which share 54% and 
51% sequence homology, respectively, with the B. amyloliquefaciens 
enzyme. All' three enzymes share no homology with the E.coli B NfnB. 
■ They do, however, exhibit weak similarity (c. 25%) to the rat DT- 
Diaphorase (DTD), Whilst. these proteins share a low level of sequence 
similarity to DTD, and other mammalian equivalents, they are 
characteristically smaller. This is because of the absence of an extensive 
internal protein domain at the N-terminus of the protein. Thus, the 
functional equivalent domain of the rat DTD between amino acid residues 
51 to 82, are missing from the BM YwrO protein. In addition, the rat DTD 
has an extra COOH-terminal domain. These bacterial enzymes are thus 
distinct from their mammalian equivalents. 

A further analysis of the B.subtilis genome, demonstrated that - two 
homologues of the E.coli .'NfnB gene were present. Their encoded proteins 
(Ydgl and YodC) share. a barely detectable level of sequence conservation 
with EC NfnB, of around 20% sequence identity. 

Bacillus subtilis was thus found to carry at least 5 different enzymes with 
'. nitroreductase activity. These are split into two families, thus;r 

DTD-like - 3 members:- YwrO, YrkL, YdeQ 
NfnB-like - 2 members:- Ydgl, YodC 

Example 3 



Recombinant Production of Nitroreductases from Bacillus subtilis 
The DNA encoding all 5. B.subtilis nitroreductase enzymes were 
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from genomic DNA using PGR and the resultant genes, following 
authentication by nucleotide sequencing, subcloned into a propriety 
CAMR expression vector <pMTL1015). The expression clones generated 
have been used to overproduce each of the 5 proteins and the enzymic 
activity of each assessed in crude lysates. This analysis has demonstrated 
that whilst the B.subtilis YwrO shares similar properties to the B. 
amyloliquefaciens homologue (is., converts CB1 954 to the 4HX derivative 
alone, but is inactive against SN23862), YrkL and YdeQ have no activity 
against either of the two prodrugs tested (CB1 954 or SN23862) but they 
may be active against other prodrugs. 

Despite the extremely limited sequence similarity to EC NfnB, Ydgl and 
YodC are active against both CB1954 and SN23862. They do, however, 
produce both the 2HX and 4HX derivatives of CB1954. Their 
characterisation has shown that they turn over CB1 954 at higher rates than 
EC NfnB (YodC k cat 58, Ydgl k cat 30.3 cf 6 for NfnB). Both show a high 
affinity for menadione and flavins, but they differ in that whereas Ydgl uses 
both NADH and NADPH, YodC shows a preference for the latter. The 
native molecular mass of YodC (approximately 90kDa> indicates that it is 
tetrameric (molecular mass estimated from amino acid sequence and by 
SDS-PAGE being approximately 22 kDa) whereas Ydgl appears to be a 
dimer in the native state. (molecular mass by gel filtration approximately 49 
kDa). 

\ These finding are further illustrated in Table 2. 
Example 4 

Bacillus lautus & Bacillus pumilis nitroreductases 

From 103 soil sample isolates tested, two strains {Bacillus pumilis CP044 
and Bacillus, lautus CP060) had been previously chosen as possessing 
extracts which showed the most rapid reduction of both CB1 954 and 
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SN23362. Purification experiments demonstrated that the activity in both 
extracts was distributed across three distinct peaks. The presence of more 
than one enzyme activity is consistent with our discovery of multiple forms 
of proteins in Bacillus able to turnover prodrugs. Eventual purification of the 
three enzymes of B. pumilis CP044 revealed that no one candidate 
exhibited properties which were an improvement on the E.coli NfnB 
enzyme. In. contrast, the proteins in peak 1 and peak 3 of the B.lautus 
CP060 were determined to offer advantage over NfnB. 

Thus, whilst the enzyme in peak 1 did not produce the required 4HX 
derivative of CB1954, it exhibited a 4-fold lower Km with the prodrug 
SN23862. The enzyme of peak 3 was, however, deemed to be of greatest 
value as it converted CB1 954 solely into the 4HX derivative and had a Km 
approximately 4-fold lower than NfnB. Furthermore, it also had activity 
against SN23862. In this respect it shares the properties of both the 
Bacillus DTD-like family (ie., it produces only the 4HX derivative) and the 
NfnB-like family (ie., it is active against SN23862) - these findings are 
illustrated in Table 3. 

Example 5 

N-terminai Sequencing of B. lautus Nitroreductase 

Electrophoretic separation of the peak 3 demonstrated that 4 protein bands 
were present which could account for the observed prodrug activity. All 
four were subjected to N-terrninai amino acid sequencing and the activity 
localised to the fourth protein band from which the nitroreductase may be 
purified. 

Example 6 



Detection of Nitroreductase Activity in Thermophile Extracts 

As an alternative source novel enzymes, a preliminary screen of CAMRs 
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thermophile collection was undertaken. Enzymes from this source may 
have the advantage of greater stability, and therefore longevity of action. 
Strains were selected on the basis either of sensitivity to CB1 954, or those, 
which are resistant but which impart a yellow/golden coloration to agar 
containing prodrug. 

Two of these strains (B. thermoflavus and B. iicheniformis) generated the 
cytotoxic 4HX form and were selected for further study. 

Example 7 

Identification Of Further Nitroreductase Enzymes 

Having identified the two families of nitroreductase in Bacillus, a search 
was undertaken of both finished and unfinished genomes for homologues, 
using YwrO and YodC/NfnB. On the basis of this search homologues of 
YwrO were identified in the genomes of Yersinia pestis and Porphyromonas 
gingivalis, and homologues of NfnB in the genomes of Pyrococcus furiosus, 
Haemophilus influenza, Synechocystis PCC 6803, Campylobacter jejuni, 
Archaeglobus, Helicobacter pylori, Heiiocbacter fulgidus and Thermus 
aquaticus. 

In addition to the above, two E.coli genes were found to be homologues of 
rat DTD and YwrO, and were designated Yher and YabF. They were 
discovered to share the characteristic of YwrO in that they lack the internal 
protein domain found in the rat DTD enzyme and functional mammalian 
homologues. 

(i) P. gingivalis YwrO homologue 



P. gingivalis YwrO homologue is a dimeric flavoprotein with native molecular 
mass estimated by gel filtration at 40 kDa. Although it shares sequence 
homology with DTD and forms only the 4HX reduction product of CB1 954 
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(K m 1 lOOfjM.t k caT 3.2), it differs from DTD in that it is active with SN23362 
and it can only use NADH as cofactor (cf DTD which can use either NADH 
or NADPH and is inactive with SN23862). It can reduce 'azodyes but it is 
inactive with menadione or flavins. 

(ii) C. jejuni NfnB homologue 

C. jejuni NfnB homologue produces only the 4HX reduction product of 
CB1954 (K m 143 pM, k cat 11.2) using NADPH as cofactor and it is also 
active with SN23862. It can use the quinone menadione as substrate as 
well as azodyes and the flavins FMN and FAD. 

(iii) Archaeogiobus fulgidus NfnB homologue 

Archaeoglobus fulgidus NfnB homologue is a dimeric fiavoprotein of 42 kDa 
. native molecular mass, producing the 4HX derivative of CB1 954 only (K m 
690 ijM, k cal 56.2) using NADPH as cofactor. It is also active with 
SN23862 and menadione (K m 9//M), but does not decolourise azodyes and 
has only weak flavin reductase activity. 

(iv> H. influenzae and. H. pylori NfnB homologues 

Both these enzymes are dimeric flavoproteins and form the 4HX reduction 
product of CB1954 using NADPH in preference to NADH, but have no 
activity with azodyes. The former also lacks activity with the quinone 
menadione and flavins FMN or. FAD. Both however have weak activity 
with SN23862 and may be active with other prodrugs. 

(v) Y.pestis nfnB homologue and Synechocystis YwrO homologue 

Both these proteins reduce CB1954 but produce only the relatively non- 
toxic 2HX derivative using NADPH as cofactor. They do however show 
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activity- with. SN23862 and the former can also reduce azodyes. 
Example 8 

Comparison of Nitroreductase Sequences 

We compared the amino acid sequences of nitoreductases according to the 
invention with each other and with known rat, human and E.coli 
sequences, and the results are illustrated in Figures 1 and 2. In Figure 1,, 
rat, mouse and two human sequences make up the first four lanes for 
comparison purposes. It is evident that nitroreductases of the invention are 
lacking a sequence from positions 51-82 of the rat sequence. 

In Figure 2; sequences of nitroreductases of the invention are compared 
with the known E.coli sequence, which is designated nfmB in the second- 
to-last lane. 

The invention thus provides nitroreductase enzymes, DNA and genes 
therefor and methods of obtaining such enzymes and of using the enzymes 
and DNA coding therefor in clinical applications. 
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Table 3: Fractionation of nitroreductase activity in ceil extracts of Bacillus lautus and Bacillus pumilis 
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Table 4: Characteristics of nitroreductase activity of thermophiles identified as being sensitive to CB1954 
[Identified as Bacillus ihermoflavus a , Bacillus licheniformis *>, Bacillus licheniformis c, Bacillus alkophilus d ] 
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Table 4: Characteristics of nitroreductase activity of thermophiies identified as being sensitive to CB1954 
[Identified as Bacillus (her mqflavus », Bacillus licheniformis b, Bacillus licheniformis c Bacillus alkophilus <*] 
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CLAIMS 

1. A nitroreductase characterised in that it preferentially reduces 
CB1954 to a cytotoxic 4-hydroxylamine (4HX) derivative instead of a non- 

5 cytotoxic 2-hydroxylamine derivative. 

2. A nitroreductase according to Claim 1 further characterised in that 
it reduces" CB1 954 to the 4HX derivative with a K m of less than 700 
micromoiar. 

10 

3. A nitroreductase according to Claim 1 or 2 further characterised in 
that it is NADPH dependant. 

4. A nitroreductase according to any of Claims 1 to 3, further 
15 characterised in that it reduces CB1954 to a cytotoxic 4-hydroxyiamine 

(4HX) derivative substantially without producing the non-cytotoxic 2- 
hydroxylamine derivative. 

5. A nitroreductase according to any of Claims 1 to 4 which reduces 
20 the prodrug to the toxic derivative with a Kcat of at least 8. 

6. A nitroreductase according to any of Claims 1 to 5, which reduces 
CB1954 or an analogue thereof to a toxic derivative, shares at least 50% 
sequence identity with the rat DTD sequence and does not contain a 

25 domain that is the same as or corresponds to amino acids 51 to 82 of the 

rat DTD sequence. 

7 . A nitroreductase characterised in that it reduces a prodrug to a toxic 
derivative with a K m of less 700 micromoiar, wherein the prodrug is 

30 selected from CB1954 and analogues thereof. 

8. A nitroreductase according to Claim 7 which reduces the prodrug to 
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the toxic derivative with a K m of less 300 micromolar. 

9. A nitroreductase according to Claim 7 or 8 which reduces the 
prodrug to the toxic derivative with a Kcaf of at least 8. 

5 

1 0. A nitroreductase according to Claim 9 which reduces the prodrug to 
the toxic derivative with a Kcaf of at least 1 0. 

11. A nitroreductase according to any of Claims 7 to 10, further 
10 characterised in that it reduces CB1954 to a toxic derivative, it reduces 

SN23862 to a toxic derivative, it can use both NADH and NADPH as 
electron donor and in that it shares no more than 30% sequence identity 
with the E.cofi NfnB sequence. 



15 



20 



12. A nitroreductase according to any of Claims; 7 to 1 1 further 
characterised in that it shares at least 50% sequence identity with the rat 
DTD sequence and in that it does not contain a domain that is the same as 
or corresponds to amino acids 51 to 82 of the rat DTD sequence. 

1 3. A nitroreductase characterised in that it reduces a prodrug to a toxic 
derivative with a Kcaf of at least 8. 



1 4. A nitroreductase according to Claim 1 3, further characterised in that 
it reduces CB1954 to a toxic derivative, it reduces SN23862 to a toxic 

25 derivative, it can use both NADH and NADPH as electron donor and in that 

it shares no more than 30% sequence identity with the E.coli NfnB 
sequence. 

15. A nitroreductase according to Claim 1 3 or 1 4, further characterised 
30 in that it reduces CB1 954 or an analogue thereof to a toxic derivative, in 

that it shares at least 50% sequence identity with the rat DTD sequence 
and in that it does not contain a domain that is the same as or corresponds 
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to amino acids 51 to 82 of the rat DTD sequence. 

16. A nitroreductase characterised in that it reduces CB1 954 to a toxic 
derivative, it reduces SN23862 to a toxic derivative, it can use both NADH 

5 and NADPH as electron donor and in that it shares no more than 30% 

sequence identity with the E.coli NfnB sequence. 

17. A nitroreductase according to Claim 16, wherein the sequence 
identity is about 25% or less. 

10 

18. A nitroreductase characterised in that it reduces CB1954 or an 
analogue thereof to a toxic derivative, in that it shares at least 50% 
sequence identity with the rat DTD sequence and in that it does not 
contain a domain that is the same as or corresponds to amino acids 51 to 

15 82 of the rat DTD sequence. 

1 9. Use of a DNA sequence coding for a nitroreductase according to any 
preceding Claim in manufacture of a medicament for prodrug therapy. 

20 20. A viral vector, comprising 

(a) a DNA encoding nitroreductase according to any of 
Claims 1 to 18 operatively coupled to 

(b) a promoter for expression of the DNA. 

25 21. A mini-gene comprising 

(a) a DNA encoding nitroreductase according to any of 
Claims 1 to 18 operatively coupled to 

(b) a promoter for expression of the DNA. 

30 22. A pharmaceutical composition comprising a nitroreductaseaccording 

to any of Claims 1 to 1 8 in combination with a pharmaceutical^ acceptable 
carrier. 
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23. A pharmaceutical composition for use in a directed-enzyme prodrug 
therapy, comprising a pharmaceuticaily acceptable carrier and a compound 
for converting a prodrug into a drug, wherein a compound comprises a 
nitroreductase according to any of Claims 1 to 1 8 conjugated to a targeting 

5 moiety. 

24. A pharmaceutical composition according to Claim 23 wherein the 
targeting moiety comprises an antibody specific for a target cell. 

10 25. A pharmaceutical composition according to Claim 23 wherein the 

targeting moiety is a moiety preferentially accumulated by or taking up by 
a target cell. 

26. A method of preparing a nitroreductase, comprising expressing a 
15 gene in a bacterial cell, wherein the gene codes for a nitroreductase 

according to any of Claims 1 to 18. 

27. Use of a nitroreductase according to any of Claims 1-18 in 
manufacture of a medicament for anti-tumour therapy. 



20 



28. Use of a compound comprising a nitroreductase according to any of 
Claims 1 to 1 8 conjugated to a targeting moiety in manufacture of a 
medicament for anti-tumour therapy. 
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Fio. 1 



DTD-Li k 5 Proteins 

• MAnrP\H-cuinons oxidoreductase -1 ■ (brown rax): 
The aligned proteins are: NQOWat NAD(P)H NQQ1 hurnan , . NAD(P}H-quinona 

NQOl.mouse, NAD(P)H-quinone oxidoreductas- J .rn ^ 2 {numan}; Yersinia 

oxidoreductase 1 (human); NQ02_human, NADj?)H^non- o^^^ ^ vwr0$ufaTfli ywr0 
un-named homologue (/ers/n/a pasw); yneR_=coii. . yn- ». vrkLsubT || f vrkl (fi»c/7/u* joAd7/s): 

ydeQsubTii. ydeQ (BaciiFus subtiiisi; Porpn>ng. un-named homoiogu- U 
and; yabr_Ecof!, yabF {Escherichia coif) 
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MfflKTNQFl 



vdgI-33 
yodC-3fl 
Synachocystis 

-£ab_Qatcl 
sfaS 



ydgl-Bs 
yodC-Ba 
Syaachocystia 

Taq 

af sJd__ azir cl 



ydgZ-Bs 
yodC-3s 
Synaciocystis 
Tacj 

Sal_typlii=i 
sfaS 



ydgl-Bs 
yodC -Ba 
Synacaocysti; 
Tag 

cfni>_ent2l 
3aam„inf 




Nfn3-Like Proreins . 

The aligned proteins are: yd 9 l-BS. ydgl Oacitlus sutrilisV. yodC-i^ yodC [Bacillus wMfel; 
Synechocys^. or 9 A ,Sy„*c»ccyr* PCC S803); Tap., NOX THE7H J'™'. . - 

Sal TYPhim. nine (Salmonella ryp^ni/ml; r>fnb_enici. ntn3 [Etrterooacrer cloacae). mr.B. nrnS 
{Escherichia coli B). and: Haem_inf. YC78_HA£!N ^Haemophilus influenzae). 
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SEQUENCE LISTING 

<1I0> Microbiological Research Authority 

<120> Nitroreductase Enzymes 

<=130> gws/2122S -seq 

<140> 
<141> 

<160> 27 

<17G> Patentln Ver. 2.1 

«c210> 1 
<211> 525 
<212> DNA 

<213> Bacillus antyloliouef aciens 

<220> 

<221> CDS 

<222> (1) . . (525) 

<400> 1 

gtg aaa at a ttg gta tta gcg gtt cac cct gac atg gag aac tea gcg 

M»t Lvs Val Leu Val Leu Ala Val His Pro Asp Met Glu Asn Ser Ala 

x " 5 . io 15 ; 

gtc aat aag gca tgg gca gaa gaa tta aaa aaa cat gat gaa etc acg 
Val Asn Lys Ala Trp Ala Glu Glu Leu Lys Lys His Asp Glu Leu Thr 
20 25 30 



ccg ctg tat tgg tac agt gcg cct ccg ctt tta aaa aca tgg atg gat 

i>ra Leu Tyr Trp Tyr Ser Ala Pro Pro Leu Leu Lys Thr Trp Met Asp 

65 * 70 75 80 

cat gtg ctg teg tac ggc tgg gec tac ggc tec aaa gga aag gcg ctg 

His Val Leu Ser Tyr Gly Trp Ala Tyr Gly Ser Lys Gly Lys Ala Leu 

85 SO 95 



gca tac cag gca gga ggg tea aac cac ttt aca ttg age gag ctg tta 

Ala Tyr Gin Ala Gly Glv Ser Asn His Phe Thr Leu Ser Glu Leu Leu 
115 " * 120 125 

agg ccg ttt cag gca atg get aat ttt aca ggt atg acc tat ttg ccg 

Arg Pro Phe Gin Ala Met Ala Asn Phe Thr Gly Met Thr Tyr Leu Pro 
130 135 140 



48 



96 



gtc cgt gag ctt tat aaa gaa tat ccg gac ggg caa ate gat gcg gaa 144 
Val Arg Glu Leu Tyr Lys Glu Tyr Pro Asp Gly Gin lie Asp Ala Glu 
35 40 45 

aag gaa cct cag ctg tgt gaa cag tat gac egg ate gta ttt caa ttt 192 
Lys Glu Arg Gin Leu Cys Glu Gin Tyr Asp Arg lie Val. Phe Gin Phe 
50 55 60 



240 



288 



cat ggc aaa gaa ttg atg ctg get gtt tec gta ggt gec gga gag gat 3 36 
His Gly Lys Glu Leu Met Leu Ala Val Ser Val Gly Ala Gly Glu Asp 
100 105 110 



384 



432 



get ttc gcg ctg tac ggt gta aat ggg gcg gat gcg acg gat att cat 4 80 

Ala Phe Ala Leu Tyr Gly Val Asn Gly Ala Asp Ala Thr Asp lie His 

145 150 155 160 

gac aat gec aaa cgt ctg get get tac ata aag aaa tea ttt taa 525 
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Aso Asn Ala Lvs Arg Leu Ala Ala Tyr lie Lys Lys Ser Phe 

165 170 175 

<210> 2 
<211> 175 
<212> ?RT 

<213> Bacillus amy lo 1 i que f ac i ens 
<400> 2 

Val Lvs Val Leu Val Leu Ala Val His Pro Asp Met: Glu Asn Ser Ala ■ 
1 5 10 15 

Val Asn Lys Ala Trp Ala Glu Glu Leu Lys Lys His Asp Glu Leu Thr 
20- 25' 30 

Val Arg Glu Leu Tyr Lys Glu 'Tyr Pro Asp Gly Gin lie Asp Ala Glu 
35 40 45 

Lys Glu Arg Gin Leu Cys Glu Gin Tyr Asp Arg lie Val Phe Gin Phe 
50 55 60 

Pro Leu Tvr Trp Tvr Ser Ala Pro Pro Leu Leu Lys Thr Trp Met Asp 
65 70 75 80 

His Val Leu Ser Tvr Gly Trp Ala Tyr Gly Ser Lys Gly Lys Ala Leu 
85 90 95 

His Gly Lys Glu Leu Met Leu Ala Val Ser Val Gly Ala Gly Glu Asp 
100 105 110 

Ala Tyr Gin Ala Glv Gly Ser Asn His Phe Thr Leu Ser Glu Leu Leu 
115 ' ' 120 125 

Arg Pro Phe Gin Ala Met Ala Asn Phe Thr Gly Met Thr Tyr Leu Pro 
130 135 140 

Ala Phe Ala Leu Tvr Gly Val Asn Gly Ala Asp Ala Thr Asp He His 
145 • * 150 155 160 

Asp Asn Ala Lys Arg Leu Ala Ala Tyr He Lys Lys Ser Phe 

165 170 175 

<210> 3 
<211> 526 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (528) 

<400> 3 

atg aaa ata ttg gtt ttg gca gtg cat cct cat atg gag acc tea gtt 48 

Met Lvs He Leu Val Leu Ala Val His Pro His Met Glu Thr Ser Val 

1 5 10 15 

att aat aag gcg tgg get gag gaa ttg agt aaa cat gac aat ate aca 96 
Val Asn Lvs Ala Tro Ala Glu Glu Leu Ser Lys His Asp Asn He Thr 
20 " 25 30 



gta egg gat ctt tat aag gaa tac ccg gat gaa gcg ata gat gtt gcg 

Val Arg Asp Leu Tvr Lys Glu Tyr Pro Asp Glu Ala He Asp Val Ala 
35 " 40 45 

aag gaa cag cag ctg tgc gag gaa tac gat egg att gtc ttt caa tte 



144 



192 
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Lvs Glu Gin- Gin Leu Cvs Glu Glu Tyr Asp Arg lie Val Phe Gin Phe 
50 55 . 60 

ccg eta tat tgg tac age tct ccg : ccg etc ttg aaa aaa tgg cag gat 240 
Pro Leu Tvr Trp Tvr Ser Ser Pro Pro Leu Leu Lys Lys Trp Gin Asp 
55 ~ 70 75 SC 

ctt gtg ctg act tat ggc tgg get ttt ggt tea gaa gga aat gee ttg 28 8 
Leu Val Leu Thr Tyr Glv Trr> Ala Phe Gly Ser Glu Glv Asn Ala Leu 
85 90 " 95 

cat ggc aag gag ctg atg ctg get gta tea aca ggg age gaa gca gaa 336 
His Glv Lvs Glu Leu Met Leu Ala Val Ser Thr Gly Ser Glu Ala Glu 
100 105 110 

aaa tat caa gcg ggc gga gca aat cat tac teg ate agt gag eta ttg 3 84 
Lys Tyr Gin Ala Gly Gly Ala Asn His Tyr Ser lie Ser Glu Leu Leu 
115 ' 120 125 

aaa cca ttt cag gec acg agt aat ctg ate ggc atg aag tat ctg cct 432 
Lys Pro Phe Gin Ala Thr Ser Asn Leu lie Glv Met Lys Tvr Leu Pro 
130 135 140 

cca tat gtg ttc tat ggc gtg aat tat gca get gca gag gat att tct 480 
Pro Tvr Val Phe Tyr Glv Val Asn Tvr Ala Ala Ala Glu Asp lie Ser 
145 " 150. 155 160 

cac agt gca aaa egg tta gee gaa tac ate cag cag cct ttt gtt taa 52 8 
His Ser Ala Lys Arg Leu Ala Glu Tyr lie Gin Gin Pro Phe Val . 

165 170 175 

<210> 4 
<211> 176 
<212> PRT 

<213> Bacillus subtilis 
<400> 4 

Met Lys lie Leu Val Leu Ala Val His Pro His Met Glu Thr Ser Val 
1 5 10 - . 15 

Val Asn Lys Ala Trp Ala Glu Glu Leu Ser Lys His Asp Asn lie Thr 

20 .25 30 

Val Arg Asp Leu Tyr Lys Glu Tyr Pro Asp Glu Ala lie Asp Val Ala 

35 ....... 40 45 

Lvs Glu Gin Gin Leu Cvs Glu Glu Tyr Asp Arg lie Val Phe Gin Phe 
50 55 60 

Pro Leu Tyr Trp Tyr Ser Ser Pro Pro Leu Leu Lys Lys Trp Gin Asp 
65 "70 75 80 

Leu Val Leu Thr Tvr Gly Trp Ala Phe Gly Ser Glu Gly Asn Ala Leu 
85 "* " 90 95 

His Gly Lvs Glu Leu Met Leu Ala Val Ser Thr Gly Ser Glu Ala Glu 
100 105 110 

Lys Tyr Gin Ala Gly Gly Ala Asn His Tyr Ser lie Ser Glu Leu Leu 
115 * 120 125 

Lys Pro Phe Gin Ala Thr Ser Asn Leu lie Gly Met Lys Tyr Leu Pro 
130 135 140 

Pro Tyr Val Phe Tyr Gly Val Asn Tyr Ala Ala Ala Glu Asp lie Ser 
145 150 155 160 
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His Ser Ala Lys Arg Leu Ala Glu Tyr lie Gin Gin Pro Phe Val 
165 .-. 170 175. • 

<210> 5 
<211> 525 
<212> DMA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (525) 

<400> 5' " ' 

atg aaa aca tta gtt ate gtt ata cat cct . aat ttg gaa acg tct gtt 48 
Met Lys Thr Leu Val lie Val lie His Pro Asn Leu Glu Thr Ser Val 
1-5 .10 15 

gtc aac aaa acc tgg atg aat cgt tea aag caa gag aaa gac att acg 96 
Val Asn Lys Thr Trp Met Asn Arg Leu Lys Gin Glu Lys Asp lie Thr 
20 25 30 • 



gtt cat gac ctg tac ggt gaa tac cct aat.ttt ate att gat gta gaa 
Val His Asd Leu • Tyr Gly Glu Tyr Pro Asn Phe lie. lie Asp . Val Glu 
35 * 40 . 45 



cca atg tat tgg tac age agt ccc gcg tta etc aaa caa tgg gaa gat 
Pro Met Tvr Trx> Tvr Ser Ser Pro Ala Leu Leu Lys Gin Trp Glu Asp 
• 65 70 75 30 



gat ' tat caa gca ggc gga gaa tat -aat ate acg' ate age gag ctt ate 
Asp Tvr Gin Ala Gly Gly Glu Tvr Asn lie Thr He Ser Glu Leu lie 
\ 115 120 125 



144 



aaa gag cag cag etc ctg tta gat cat gag cgt ate gtt ttt cag. ttc 192 
Lys Glu Gin Gin Leu Leu Leu Asp His Glu Arg He Val Phe Gin Phe 
50 55 SO 



240 



gat gtg tea aca cat ggc tgg get tat gga act gga gga act aaa ttg 288 

Asp Val Leu Thr His Gly Trp Ala Tyr Gly Thr Gly Gly Thr Lys Leu . 

85 90 95 

cat gga aaa gaa eta etc tta get ate tec tea ggc gca cag gaa tct 336 

His Glv Lys Glu Leu Leu Leu Ala He Ser Ser Gly Ala Gin Glu Ser 

100 105 110 



384 



aga ccg ttt caa gtc act get aac tat ata gga atg cgt ttt ctt cct 432 
Arg Pro Phe Gin Val Thr Ala Asn Tyr He Gly Met Arg. Phe Leu Pro 
130 135 140 

gcg ttt aca caa tat ggg aca ctt. cat ctt tea aaa gaa gat gtt aag. 480 
Ala Phe Thr Gin Tyr Glv Thr Leu His Leu Ser Lys Glu Asp Val Lys 
145 150 155 160 

aac agt. gcg gag aga ttg. gtt gac tat ctt aaa gec gag cat taa 525 
Asn Ser Ala Glu Arg Leu Val Asp Tyr Leu Lys Ala Glu His 

165 170 175 

<210> 6 
<211> 175 
<212> PRT 

<213> Bacillus subtilis 
<400> 6 

Met Lys Thr Leu Val He Val He His Pro Asn Leu Glu Thr Ser Val 
1 5 ■ 10 15 
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Val hsn Lys Thr Trp Met Asn Arg Leu Lys Gin Glu Lys Asp lie Thr 
20 25 30 

Val His Astd Leu Tyr Glv Glu Tvr Pro Asn ?he lie lie Asp Val Glu 
35 40 45 

Lys Glu Gin Gin Leu Leu Leu Asp His Glu Arg He Val ?he Gin Phe 
50 55 60 

Pro Met Tvr Tro Tvr Ser Ser Pro Ala Leu Leu Lys Gin Trp Glu Asp 
65 " 70 75 80 

Asp Val Leu Thr His Glv Tro Ala Tyr Gly Thr Gly Gly Thr Lys Leu 
85 90 95 

His Gly Lvs Glu Leu Leu Leu Ala He Ser Ser Gly Ala Gin Glu Ser 
100 105 110 

Asp Tyr Gin Ala Gly Glv Glu Tvr Asn He Thr He Ser Glu Leu He 
115 120 125 

Arg Pro Phe Gin Val Thr Ala Asn Tyr He Gly Met Arg Phe Leu Pro 
130 135. 140 

Ala Phe Thr Gin Tyr Gly Thr Leu His Leu Ser Lys Glu Asp Val Lys 
145 150 155 160 

Asn Ser Ala Glu Arg Leu Val Asp Tyr Leu Lys Ala Glu His 

165 170 175 

<210> 7 
<211> 594 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (594) 

<400> 7 ' ao 

atg gat cat atg aaa aca etc gta etc gtt gta cat ccgaat.ata gaa 48 
Met Aso His Met Lys Thr Leu Val Leu Val Val His Pro Asn lie Glu 
1 ". 5 10 15 

tec tct cert ate aat aaa aag tgg aaa gaa gec gtt tta agt gaa cca 
Ser Ser Arg He Asn Lvs Lys Trp Lys Glu Ala Val Leu Ser Glu Pro 
20 25 30 

gat gta act gtc cat gat ctt tat gaa aaa tat cgc gat caa cca att 144 
Asp Val Thr Val His Asp Leu Tyr Glu Lys Tyr Arg Asp Gin Pro He 
35 * 40 45 

gat gtg gaa ttt gaa caa cag cag etc • ctg gec cat gac cgt ate gtt 
Asp Val Glu Phe Glu Gin Gin Gin Leu Leu Ala His Asp Arg He Val 
50 55 60 

ttt cag ttt cca tta tac tgg tac age age cca ccg ctt tta aaa cag 
Phe Gin Phe Pro Leu Tyr Trp Tyr Ser Ser Pro Pro Leu Leu Lys Gin 
65 .70 75 80 

tgg ttt gat aaa gtg ttt acg ttt ggc tgg get cat ggt ccc ggc gga 
Tro Phe Asp Glu Val Phe Thr Phe .Glv Trp Ala His Gly Pro Gly Gly 
85 90 95 

aat aaa ttg aag ggg aaa gag tgg gta act gee atg tec ate ggt tea 
Asn Lys Leu Lys Gly Lys Glu Trp Val Thr Ala Met Ser He Gly Ser 



96 



192 



240 



288 



336 
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100. 105 110 

cct oaa cac tct tat caa gcc ggc gga tat aac ttg ttt teg aca age 

Pro Glu His Ser Tvr Gin Ala Glv Gly Tyr Asn Leu Phe Ser He Ser 

20 12 5 



115 



aaq ctg aca aaa ccg ttc caa gca tct gcc cat tta gta ggc atg acc 
GiS W Lvs Pro Phe Gin Ala Ser Ala Kis Leu Val Gly Met Thr 

130 ' 135 140 

tat ctg cct tec ttt gcc gaa tat cgc gcc aat aca ate agt gac caa 
Tyr L^u Pro Ser Phe Ala Glu Tyr Arg Ala Asn Thr He Ser Asp Gin 
145 150 155 160 

gaa att acc gaa agt gcg aat egg tat gta aag cat att aca aat ata 
Glu He tla Glu Ser Ala Asn Arg Tyr Val Lys His He Tnr Asn He 
165 170 175 

aaa tta aac ccg aag gtt cgc ctg caa agg tat ttg aaa cag ctg gag 
Glu Leu Asn Pro Lys Val Arg Leu Gin Arg Tyr Leu Lys Gin Leu Glu 
.180 185 190 

agt gtc gat tta aca taa 
Ser Val ksn Leu Thr 
195 

<210> 8 
<211> 198 
<212> PRT 

<213> Bacillus subtilis 

M-t°Asn His Met Lvs Thr Leu Val Leu Val Val His Pro Asn He Glu 
~1 \ '5 10 15 

Ser Ser Arg He Asn Lys Lys Trp Lys Glu Ala Val Leu Ser Glu Pro 

20 ... 25 30 

Asp Val Thr Val His Asp Leu Tyr Glu Lys Tyr Arg Asp Gin Pro He 
35 40 45 

Asp Val Glu Phe Glu Gin Gin Gin Leu Leu Ala His Asp - Arg He Val 
50 55 60 

Phe Gin Phe Pro Leu Tvr Trp Tyr Ser Ser Pro Pro Leu Leu Lys Gin 
65 70 75 30 

Trp Phe Aso Glu Val Phe Thr Phe Gly Trp Ala His Gly Pro Gly Gly 
85 90 95 

Asn Lvs Leu Lys Gly Lys Glu Trp Val Thr Ala Met Ser lie Gly Ser 
100 105 HO 

Pro Glu His Ser Tyr Gin Ala Gly Gly Tyr Asn Leu Phe Ser He Ser 
-115- 120 125 

Glu Leu Thr Lys Pro Phe Gin Ala Ser Ala His Leu Val Gly Met Thr 
130 * 155 140 

Tvr L~u Pro Ser Phe Ala Glu Tyr Arg Ala Asn Thr He Ser Asp Gin 
145 • 150 155 160 

Glu He Ala Glu Ser Ala Asn Arg- Tyr Val Lys His He Thr Asn He 
165 170 175 

Glu Leu Asn Pro Lys Val Arg Leu Gin Arg Tyr Leu Lys Gin Leu Glu 



384 



432 



430 



528 



576 



594 
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180 



135 



190 



Ser Val Asc Leu Thr 
155 



<210> 9 
<211> 630 
<212> DNA 

<213> Bacillus subtil is. 



c220> 

<221> CDS 

<222> (1) • • (630) 



gac gga tac gat aaa aaa aac ate gcg gaa acc ttc gga tta gat aaa 
Gly Gly Tvr Aso Lys Glu Asn He Ala Glu Thr Phe Gly Leu Asp Lys 
165 170 175 

aaa cat tat gta ccg gtt atg eta ctt tct ate gga' aaa gca gca gac 
I" Sm Val Pro Val Me! Leu Leu Ser He Gly Lys Ala Ala Asp 

180 185 I 90 

aaa ggc cat get tec tac cgt ctg ccg att gat aca att gca gaa tgg 
G*u Gly "Tvr Ala Ser Tvr Arg Leu. Pro He Asp Thr lie Ala Glu Trp 
195 200 205 



48 



96 



144 



192 



240 



atg ate aaa aca aac gat ttt atg- gaa att atg aaa ggc cgc cgt tct 
Mo? He Lvs Thr Asn Asp Phe Met Glu He Met Lys G±y Arg Arg Ser 
"l " 5 10 15 

ate cgc aac tat gat ccg gca gta aaa ate age aaa gaa gaa atg aca 
He Arg Asn Tyr Asp Pro Ala Val Lys He Ser i,ys Glu Glu Met Thr 
20 25 -° 

aaa ate tta gag gaa gca aca act gee cca tct tct gtt aac gcg cag 
Glu" He L-u Glu Glu Ala Thr Thr Ala Pro Ser Ser Val Asn Ala Gin 

• " i 35 40 45 . 

cca tag cgt ttt ctt gtc att gac age ccg gaa gga aaa gaa aag etc 
S r o TrU Sg Phe Leu Val He Asp Ser Pro Glu Gly Lys Glu Lys Leu 
50 • 55 60 

aca ccg ctt gca age ttt aac. caa aca caa gtc aca aca tea tct get 
Ala Pro Leu Ala Ser Phe Asn Gin Thr Gin Val Thr Thr Ser Ser Ala 
65 70 75 60 

ate ate get gta ttt gca gac ang aac aac gca gac tat eta gaa gaa 288 
Val He Ala Val Phe Ala Asp Met Asn Asn Ala Asp Tyr Leu Glu Glu 

85 90 95 . 

.ate tat tea aaa gee gtg gaa ctt ggt tac atg ccg cag gag gtc aaa 336 
lie Tyr Ser Lys Ala Val Glu Leu Gly Tyr Met Pro Gin Glu Val Ly=, 
100 1° 5 110 

aac aga caa ate gec gcg ctg acc gca cat ttt gaa aag ctt ccg gca 3 84 
ftp Arg Gin lie Ala All Leu Thr Ala His Phe Glu Lys Leu Pro Ala 
X15 120 125 

cag gtc aac cgt gaa acg ate ctg att gac gga ggt ctt gtt tec atg 
Gin Val Asn A?g Glu Th? lie Leu He Asp Gly Gly Leu Val Ser Met 
130 ~ 135 140 

cag ctg atg ctg act gca cgc gcg cat ggc tac gat aca aac ccg ate 480 
Gin Leu Mel Leu Thr Ala Arg Ala His Gly Tyr Asp Thr Asn Pro lie 
145 150 . 155 160 



432 



523 



576 



624 
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aaa taa D " ilj 
Lys 

210 

<210> 10 
<211> 210 
<212> PRT 

<213> Bacillus subtilis 
<400> 10 

Met He Lys Thr Asn Asd ?he Met Glu He Met Lys Gly Arg Arg Ser 
1 * 5 10 15 

He Arg Asn Tvr Asp Pro Ala Val Lvs He Ser Lys Glu Glu Met Thr . 
20 25 30 

Glu He Leu Glu Glu Ala Thr Thr Ala Pro Ser Ser Val Asn Ala Gin 
25 40 45 

Pro t\td Arg Phe Leu Val He Asp Ser Pro Glu Gly Lys Glu Lys Leu 
50 55 60 

Ala Pro Leu Ala Ser Phe Asn Gin Thr Gin Val Thr Thr Ser Ser Ala 
* 65 70 75 80 

Val lie Ala Val Phe Ala Asp Met Asn Asn Ala Asp Tyr Leu Glu Glu 
85 90 95 

He Tyr Ser Lvs Ala Val Glu Leu Gly Tyr Met Pro Gin Glu Val Lys 
100 105 HO 

Asp Arg Gin He Ala Ala Leu Thr Ala His Phe Glu Lys Leu Pro Ala 
115 120 125 

Gin Val Asn Arg Glu Thr He Leu He Asp Gly Gly Leu Val Ser Met 
130 135 140 

Gin Leu Met Leu Thr Ala Arg Ala His Gly Tyr Asp Thr Asn Pro He 
145 150 ~ 155 160 

Glv GP y Tyr Asp Lvs Glu Asn He Ala Glu Thr Phe Gly Leu Asp Lys 
" 165 170 175 

Glu Arg Tvr Val Pro Val Met Leu Leu Ser He Gly Lys Ala Ala Asp 
" 180 185 190 

Glu Gly Tyr Ala Ser Tyr Arg Leu Pro He Asp Thr He Ala Glu Trp 
195 200 205 

Lvs 

210 

<210> 11 
<211> 609 
<212> DNA 

<213> Bacillus subtilis 

<220> 
. <221> CDS 
<222> (1) . . (609) 

<400> 11 

atg acg aat act eta gat gtt tta aaa gca cgt gca tct-gta aag gaa 48 

Met Thr Asn Thr Leu Asp Val Leu Lys Ala Arg Ala Ser Val Lys Glu 

1 5 10 15 . 
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tat oat aca aat gcc ccg ate tct aag gag gag ctg act gag eta tta 96 

Tvr Aso Thr Asn Ala Pro lie Ser Lys Glu Glu Leu Thr Glu Leu Leu 
« - - __ -*i r- in 



20 25 30 

gac etc gcc act aaa gcg cct tct get tgg aac ctt cag cat tgg cat 

Asp Leu Ala Thr Lvs Ala Pro Ser Ala Trp Asn Leu Gin His Trp His 

35 40 45 



gcg tat aat caa aaa caa ate gtt gag tct tct get gtt gtt gcc att 

Ala Tyr Asn Gin Lys Gin He Val Glu Ser Ser Ala Val Val Ala He 

65 • 70 75 80 

tta ggc gat tta aag gca aat gaa aac ggt gaa gaa gtt tat get gaa 

Leu Glv *vsp Leu Lys Ala Asn Glu Asn Gly Glu Glu Val Tyr Ala Glu 

85 90 95 

tta gca age caa ggc tat att acg gat gaa ate aaa caa aca ttg etc 

Leu Ala Ser Gin Gly Tyr He Thr Asp Glu He Lys Gin Thr Leu Leu 
100 * 105 HO 

ggc caa ate aac ggt get tac caa age gag caa ttc gca cgt gat tec 

Gly Gin He Asn Gly Ala Tyr Gin Ser Glu Gin Phe Ala Arg Asp Ser 
115 120 125 

get ttc tta aat act tct tta get get atg cag ctt atg att gcc gca 

Ala Phe Leu Asn Ala Ser Leu Ala Ala Met Gin Leu Met He Ala Ala 
130 135' 140 



aaa gca aaa ggt tat gac act tgc gca ate- ggc gga ttt aac aaa gag 
Lys Ala Lvs Gly Tyr Asp Thr Cys Ala He Gly Gly Phe Asn .Lys Glu 
145 ■ 150 155 160 

cag ttc caa aag caa ttt gat ate agt gag cgc tat gtt ccg gtt atg 
Gin Phe Gin Lvs Gin Phe Asd He Ser Glu Arg. Tyr Val Pro Val Met 
165 170 175 

ctt att tea ate age aaa gca gtg aag cct gcg cat caa age aac cgt 
Leu He Ser He Glv Lys Ala Val Lvs Pro Ala His Gin Ser Asn Arg 
180 " 1S5 . 190 

ctg ccg ctt tea aaa gta tea act tgg ctg taa . 
Leu Pro Leu Ser Lys Val Ser Thr Trp Leu 
195 200 

<210> 12 
<211> 203 
<212> PRT 

<213> Bacillus subtilis ■ 

<400> 12 ' . 

Met Thr Asn Thr Leu Asp Val Leu Lys Ala Arg Ala Ser Val Lys Glu 
1 . 5 10 15- 

Tyr Asd Thr Asn Ala Pro He Ser- Lys Glu Glu Leu Thr Glu Leu Leu 
20 1 25 30 

Asp Leu Ala Thr Lvs Ala Pro' Ser Ala Trp Asn Leu Gin His Trp His 
35 40 45 

Phe Thr Val Phe His Ser Asp Glu Ser Lys Ala Glu Leu Leu Pro Val 
50 55 60 



144 



ttt aca eta ttc cac age gat gaa tea aaa gcg gag ctt ctt cct gta 192. 
^he Thr Val Phe His Ser Asp Glu Ser Lys Ala Glu Leu Leu Pro Val 
50 55 .60 



240 



288 



336 



384 



432 



480 



528 



576 



609 
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Pla Tvr Asn Gin Lvs Gin He Val Glu Ser Ser Ala Val Val Ala He 
" 6 5 " - " 70 75 80 

Leu Gly Asp Leu Lvs Ala Asn Glu Asn Gly Glu Glu Val Tyr Ala Glu 
85 90 95 

Leu ala Ser Gin Gly Tvr He Thr Asp Glu lie Lys Gin Thr Leu Leu 
100 105 HO 

Glv Gin lie Asn Gly Ala Tyr Gin Ser Glu Gin Phe Ala Arg Asp Ser 

120 12 5 



115 



Ala Phe Leu Asn Ala Ser Leu Ala Ala Met Gin Leu Met He Ala Ala 
130 135 140 

Lvs Ala Lys Gly Tyr Asp Thr Cys Ala He Gly Gly Phe Asn Lys Glu 
145 150 ' . 155 160 

Gin Phe Gin Lys Gin Phe Asp He Ser Glu Arg' Tyr Val Pro Val Met 
165 170 175 

Leu He Ser He Gly Lvs Ala Val Lys Pro Ala His Gin Ser Asn Arg 
180 185 190 

Leu Pro Leu Ser Lys Val Ser Thr Trp Leu 
195 200 . 

<210> 13 
<211> 555 
<212> DNA 

<213> Escherichia coli 

<220> 
<221* CDS 
<222> (1) (555) 

<400> 13 

atg atg tct cag cca gcg aaa gtt ttg ctg ctg tat gcc cat ccg gaa 

Met Met Ser Gin Pro Ala Lys Val Leu Leu Leu Tyr Ala His Pro Glu. 

1 5 10 15 

tct cag gac teg gtg gca aac egg gta ctg ctt aaa ccg gcc acg cag 
Ser Gin Asp " Ser Val Ala Asn Arg Val Leu Leu Lys Pro Ala Thr Gin 
20 25 30 

etc age aat gtt acc gtg cac gac ctt tac gcg cac tat ccc gat ttt 
Leu Ser Asn Val Thr Val His Asp Leu Tyr Ala His Tyr Pro Asp Phe 
35 40 45 

ttt att gat ate ccc cgt gag cag gca tta ctg cgc gag cac gag gtg 192 
Phe He Asp He Pro Arg Glu Gin Ala Leu Leu Arg Glu His Glu Val 
50 55 60 



att gtc ttt cag cat cct ctt tat acc tat age tgc ccg gcg eta ctg 

He Val Phe Gin His Pro Leu Tyr Thr Tyr Ser Cys Pro Ala Leu Leu 

65 70 75 eo 

aaa gag tgg ctg gac egg gta tta agt cgt ggt ttt .gcc age ggg ccg 

Lys Glu Trp Leu Asp • Arg Val Leu Ser Arg Gly Phe Ala Ser Gly Pro 

85 90 95 

gga gga aac caa' ctg gcg gga aag tac tgg. cgt age gtg att acc acc 

Gly Gly Asn Gin Leu Ala Gly Lvs Tyr Trp Arg Ser Val He Thr Thr 

100 105 110 

ggc gag ccg gaa agt get tac cgt tat gac gcg ctg aat cgc tac ccg 



48 



96 



144 



240 



288 



336 



384 
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Gly Glu Pro Glu Ser Ala Tyr Arg Tyr Asp Ala Leu Asn Arg Tyr Pre;. 



115 



atg age gat gtg ctg ego ccc ttt gaa ctg gcg gcg ggc atg tgc egg 
Met Ser Asp Val Leu Arg Pro Pne Glu Leu Ala Ala Gl : , Met Cj s Arg 
130 135 



sf . cat -co cta agt ccc ate att att tac tgg gcg aga egg caa age 
Met lis Tg Leu sir Pro lie lie He Tyr Trp Ala Arg Arg Gin Ser 
145 150 

nra caa aag ctq gcg age cac gec aga gec tac ggt gac . tgg ctggca 
lit III Til III 111 dr His Ala Arg Ala Tyr Gly Asp Trp Leu Axa 



165 

aat ccg ccg tct cca gga ggc cgc tga 
Asn Pro Leu Ser Pro Gly Gly Arg 

180 185 



<210> 14 
<211> 185 
<212> PRT 

<213> Escherichia coll 
<400> 14 



Me°t°Met 4 Ser Gin.: Pro Ala Lys Val Leu Leu Leu Tyr Ala His Pro Glu 



"i 5 10 



Ser Gin Asp Ser Val Ala ' Asn Arg Val Leu Leu Lys Pro Ala Thr Gin 



20 



Leu Ser Asn Val Thr Val His Asp Leu Tyr Ala His Tyr- Pro Asp Phe 



35 



40 



Phe He Asn lie Pro Arg Glu Gin Ala Leu Leu Arg Glu His Glu Val 

50 55 . 

lie Val Phe Gin His Pro Leu Tyr Thr Tyr Ser Cys Pro Ala Leu Leu 



65 70 . 75 



Lvs Glu Trp Leu Asp Arg Val Leu. Ser Arg Gly Phe Ala Ser Gly Pro 
85 90 

Gly Gly Asn Gin Leu Ala Gly Lys Tyr Trp Arg Ser Vai lie Thr Thr 
100 105 

Gly Glu Pro Glu Ser Ala Tyr Arg Tyr Asp Ala Leu Asn Arg Tyr Pro 

115 120 
Met Ser Asp Val Leu Arg Pro Phe Glu. Leu Ala Ala Gly Met Cys Arg 

.130 135 140 ... . 

Met His Trp Leu Ser Pro He He He Tyr Trp Ala Arg- Arg Gin Ser 



145 



150 155 



432 



480 



528 



555 



Ala Gin Glu Leu Ala Ser His Ala Arg Ala Tyr Gly Asp Trp Leu Ala 
165 170 

Asn Pro Leu Ser Pro Gly Gly Arg 

180 185 



<210> 15 
<211> 531 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1) . . (531) 

<400> 15 

atg att czz ata art tat gcg cat ccg tar ccg cat cat tec cat gcg 48 

Met lie Leu lie lie Tyr Ala His Pro Tyr Pro His His Ser His Ala 

15 10 15 

a at aaa egg atg ctt gaa cag gca agg acg ctg gaa ggc 'etc gaa att " 96 
Asn Lys Ara Met Leu Glu Gin Ala Arg Thr Leu Glu Gly Val Glu lie 
20 25 30 

cgc tct ctt tar caa etc tat cct gac ttc aat ate gat att gec gec 144 
Arg Ser Leu Tyr Gin Leu Tyr Pro Asp Phe Asn lie Ast> lie Ala Ala 
35 40 45 

gag cag gag gcg ctg tct cgc gec gat ctg ate gtc tgg cag cat ccg 192 
Glu Gin Glu Ala Leu Ser Arg Ala Asp Leu lie Val Trp Gin His Pro 
50 55 60 

atg cag tgg tac age att cct ccg etc etc aaa ctt tgg ate gat aaa 240 
Met Gin Trp Tyr Ser lie Pro Pro Leu Leu Lvs Leu Trr> lie Asd Lvs 
65 70 75 80 

gtt ttc tec cae ggc tgg get tac ggt cat ggc ggc acg gcg ctg cat 2 88 
Val Phe Ser His Gly Trp Ala Tyr Gly His Glv. Gly Thr Ala Leu His 
85 90 95 

ggc aaa cat ttg ctg tgg gcg gtg acg acc ggc ggc ggg gaa age cat 3 36 
Gly Lys His Leu Leu Trp Ala Val Thr Thr Gly Gly Gly Glu Ser His 
100 105 110 

ttt gaa att ggt gcg cat ccg ggc ttt gat gtg ctg teg cag ccg eta 3 84 
Phe Glu lie Gly Ala His Pro Gly Phe Asp Val Leu Ser Gin Pro Leu 
115 120 125 

cag gcg acg gca ate tac tgc ggg ctg aac tgg ctg cca ccg ttt gee 432 
Gin Ala Thr Ala lie Tyr Cys Glv Leu Asn Trp Leu Pro Pro Phe Ala 
130 135 140 

atg cac tgc acc ttt att tgt gac gac gaa acc etc gaa ggg cag gcg 480 
Met His Cys Thr Phe lie Cys Asp Asp Glu Thr Leu Glu Gly Gin Ala 
145 150 155 ' 160 

cgt cac tat aag caa cgt ctg ctg gaa tgg cag gag gee cat cat gga 528 
Arg His Tyr Lys Gin Arg Leu Leu Glu Tro Gin Glu Ala His His Glv 
165 170 175 

tag 531 



<210> 16 
<211> 177 
<212> PRT 

<213> Escherichia coli 
<400> 16 

Met lie Leu He He Tyr Ala His Pro Tyr Pro His His Ser His Ala 
15 10 15 

Asn Lys Arg Met Leu Glu Gin Ala Arg Thr Leu Glu Gly Val Glu He 
.20 25 30 

Arg Ser Leu Tyr Gin Leu Tyr Pro Asp Phe Asn He Asp lie Ala Ala 
35 40 45 
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Glu Gin Glu Ala Leu. Ser Arg Ala Asp Leu lie Val Trp Gin His Pro 
50 - 55 60 

Met Gin Tro Tyr Ser lie Pro Pro Leu Leu Lys Leu Trp He Asp Lys 
65 " 70 75 80 

Val ^he Ser Kis Gly Tro Ala Tyr Glv His Gly Gly Thr Ala Leu His 
35 90 95 

Gly Lvs His Leu Leu Trp Ala Val Thr Thr Gly Gly Gly Glu Ser His 
100 105 110 

Phe Glu He Gly Ala His Pro Glv Phe Asp Val Leu Ser Gin Pro Leu 
115 * 120 125 

Gin Ala Thr Ala He Tyr Cys Gly Leu Asn Trp Leu Pro Pro Phe Ala 
130 135 140 

Met His Cvs Thr Phe He Cys Asp Aso Glu Thr Leu Glu Gly Gin Ala 
145 150 155 160 

Arg His Tyr Lys Gin Arg Leu Leu Glu Trp Gin Glu Ala His. His 
• • 165 170 ■ 175 

Gly 

<210> 17 
<211> 222 
<212> PRT 

<213> Haemophilus influenzae . 
<400> 17 

Met Thr Gin Leu Thr Arg Glu Gin Val Leu Glu - Leu Phe His Gin Arg 
1 5. 10- 15 - 

Ser Ser Thr Arg Tyr Tyr Asp Pro Thr Lys Lys He Ser Asp Glu Asp 
20 25 30 

Phe Glu Cys He Leu Glu Cys Gly Arg Leu Ser Pro Ser Ser Val Gly 
35 40 45 

Ser Glu Pro Trp Lys Phe Leu Val He Gin Asn Lys Thr Leu -Arg Glu 
50 " *55 60 

Lvs Met Lys Pro Phe Ser Trp Gly . Met He Asn Gin Leu Asp Asn Cys 

■65 ' 70 -75 80 

Ser His Leu Val Val He Leu Ala Lys Lys Asn Ala Arg Tyr Asp Ser 
85 90 95 

Gin Gin Gin Ala Ala Leu Thr Lys Tyr Lys Ala Leu Gin Glu Glu Asp 
100 105 110 

Met Lys Leu Leu Glu Asn Asp Arg Thr Leu Phe Asp Trp Cys Ser Lys 
115 120 125 

Gin Thr Tyr He Ala Leu Ala Asn Met Leu Thr Gly Ala Ser Ala Leu 
130 135 140 

Gly He Ast> Ser Cvs Pro He Glu Gly Phe His Tyr Asp Lys Met Asn 
145 " 150 155 • 160 

Glu Cys Leu Ala Glu Glu Gly Leu Phe Asp Pro Gin Glu Tyr Ala Val 
165 170 175 

Lys Ser Arg Lys Gly Leu Asp Glu Val Val Lys Trp Val Gly 
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130 185 190 

<210> 18 
<211> 207 
<212>" PRT 

<213> Thermus aquaticus 
c4G0> 18 

Met Glu Pla. Thr Leu Pro Val Leu Asp Ala Lys Thr Ala Ala Leu Lys 
1 5 10 . 15 

Arq Arg Ser He Arg Arg Tyr Arg Lys Asp Pro Val Pro Glu Gly Leu 
20 25 30 

Leu Arg Glu He Leu. Glu Ala Ala Leu Arg Ala Pro Ser Ala Trp Asn 
35 .40 45 

L«u Gin Pro Tro Arg lie .Val Val Val Arg Asp Pro Ala Thr Lys Arg 
50 " 55. 60 

Ala Leu Ara Glu Ala Ala Phe Gly Gin Ala His Val Glu Glu Ala Pro 
t_ - 70 75 80 



65 



Val Val Leu Val Leu Tyr Ala Aso Leu Glu Asp Ala Leu Ala His Leu 
85 90 . .95 

Gin Lys Gin Ala He Gin Arg Ala Phe Ala Ala Met Gly Gin Glu Ala 
100 105 110 

Arg Lys Ala Trp Ala Ser Gly Gin Ser Tyr He Leu Leu Gly Tyr Leu 
115 120 125 

Leu Leu Leu' Leu Glu Ala Tyr Gly Leu Gly Ser Val Pro Met Leu Gly 
130 ' 135. . 140 

i>he Asp Pro Glu Arg Val Arg Ala He Leu Gly Leu Pro Ser Arg Ala 

I 4 5 150 ~ 155 160 

Ala lie Pro Ala Leu Val Ala Leu Gly Tyr Pro Ala Glu Glu Gly Tyr 
165 170 175 

^ro Ser His Arg Leu Pro Leu Glu Arg 0 Val Val Leu Trp Arg 
180 185 0 190 



<210> 19 

<211> 212 . . 
<212> PRT 

<213> Synechocystis PCC6803 

<400> 19 . 
Met Aso Thr Phe Asp Ala. He Tyr Gin Arg Arg Ser Val Lys His Phe 
1 5 .10 .15 

Asp Pro Aso His Arg Leu Thr Ala Glu Glu Glu Arg Lys Leu His Glu. 
20 25 30 

Ala Ala lie Gin Ala Pro Thr Ser Phe Asn lie Gin Leu Trp Arg Phe 
35 40 45 

Leu He He Arg Asd Pro Gin Leu Arg Gin Thr lie Arg Glu Lys Tyr 
50 55 60 

Glv Asn Gin Ala Gin Met Thr Asp Ala Ser Leu Leu He Leu Val- Ala. 
65 70 75 80 
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Ala Asp Val Asn Ala Trp Asp Lys Asd Pro Ala Arg Tyr Trp Arg Asn 

85 90 ■ ... 95 

Phe Tyr Gly Gly Lys Pro Gin Leu Gin Arg A.sp Glu Ala Gin Arg Ser 
100 105 110 

lie Gly Met Ala Met: Gin Asn Leu Met Leu Ala Ala Lys Ala Met Gly 
115 120 125 . 

Tvr Asp Ser Cvs Pro Met lie. Gly Phe Asd Leu Gin Lys Val. Ala Glu 

130 * 135 , * 140 

Leu Val Lys Leu Pro Ala Asp Tyr Ala lie Glv Pro Met Val Ala lie 
145 150 155 " 160 

Gly Lys Arg Thr Glu Asp Ala Pro Gly Lvs Arg Arg Ser Asri Ser Pro 
165 170 175 

Cys Leu Ala lie 
180 



<210> 20 
c211> 172 
<212> PRT ■ 

<213> Archaeoglobus fulgidus 
<400> 20 

Met Glu Cys Leu Asp Leu Leu Phe Arg Arg Val Ser lie Arg Lys Phe 

1 5 ■ * 10 15 

Thr Gin Asp Asp Val Asp Asp Glu lie Leu Met Lys lie. Leu Glu Ala 
20 25 30 

Gly Asn Ala Ala Pro Ser Ala Gly Asn Leu Gin Ala Arg Asp Phe Val 
35 40 45 

Val lie Arg Asn Pro Glu Thr Lys Lys Arg Leu Ala Met Ala Ala Leu 
50 55 - SO 

Lys Gin Met' Phe lie Ala Glu Ala Pro Val Val He Val Val Cys Ala. 
65 70 75 80- 

Asn Tyr Pro Arg Ser Met Arg Val Tyr Glv Glu Arg Gly Arg Leu Tyr 
85 90 95 

Ala Glu Gin Asp Ala Thr Ala Ala He Glu Asn He Leu Leu Ala Val 
100 105 110 

Thr Ala Leu Asn Leu Gly Ala Val Trp Val Gly Ala Phe Asp Glu Glu 
115 120 125 

Gin Val Ser ' Glu He Leu Glu Leu Pro Glu Tyr Val Arg Pro Met Ala 
130 135 140 

He He Pro He Glv His Pro Ala Glu Asn Pro Ser Pro Arg Asn Arg" 
145 " 150 155 160 

Tyr Pro Val Ser Met Leu Thr His Phe Glu Lys Trp 
165 170 



<210> 21 

<211> 174 

<212> PRT 

<213> Archaeoglobus 



fulgidus 
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<400> 21 

M*t Glu Glu Cys Leu Lys Met He Tyr Thr Arg Arg Ser He Arg Val 
"l 5 10 15 

Tvr Ser Asp Arg Gin He Ser Asp Glu Asp He Glu Lys He Leu Lys 
"20 25 30 

£Ta Ala Met Leu Ala Pro Ser Ala Gly Asn Glu Gin Pro Trp His ?he 
35 40 45 

He Val Val Arg Asp Arg Glu Met Leu Lys Lys Met Ser Glu Ala Phe 
50 55 60 

Thr Phe Gly Gin Met Leu Pro Asn Ala Ser Ala Ala He Val Val Cys 
65 70 75 .80 

Ala Aso Pro Lys Leu Ser Lys Tyr Pro Tyr Asp Met Trp Val Gin Asp 
85 90 95 

Cys s ~r Ala Ala Thr Glu Asn He Leu Leu Ala Ala Arg Cys Leu Gly 
.100 105 HO 

He Gly Ser Val Trp Leu Gly Val Tvr Pro Arg Glu Glu Arg Met Lys 
115 " 120 125 

Ala Leu Arg Glu Leu Leu Gly He Pro Glu Asn He Val Val Phe Ser 
130 ~ 135 • 140 

Val Val Ser Leu Gly Tvr Pro Lys Asp Glu Lys Asp Phe Tyr Glu Ala 
145 150 155 160 

Asp Asp Arg Phe Asn Pro Asp Arg He His Arg Glu Lys Trp 
165 170 

<210> 22 
<211> 606 
<212> DNA 

<213> Campylobacter jejuni 

<220> 
<221> CDS 
<222> (1) . . (606) 

<400> 22 

atg aaa aaa gaa ctt gaa att ttt age aca aga tat tct tgt aga aat 

Met Lys Lys Glu Leu Glu He Phe Ser Thr Arg Tyr Ser Cys Arg Asn 
1 5 10 15 

ttt aaa aat gaa aaa etc aaa aaa gag gat tta aat tct att tta gaa 96 
Phe Lys Asn Glu Lys Leu Lys Lys Glu Asp Leu Asn Ser He Leu Glu 
20 25 30 

ata gca aga tta age ccc agt tec ttg gga ctg gaa cct tag aaa ttt 14 4 
He Ala Arg Leu Ser" Pro Ser Ser Leu Gly Leu Glu Pro Trp Lys Phe 
35 40 45 

ata gta gtg caa gat gag aaa aga aaa gaa gaa ctt tct aaa att tgc 192 
He Val Val Gin Asp Glu Lvs Arg Lys Glu Glu Leu Ser Lys He Cys 
50 55' SO 



aat caa caa aaa cat gta aaa gat tgt get gca tta att ata ate att 
Asn Gin Gin Lys His Val Lvs Asp Cys Ala Ala Leu He He He He 
65 .70 75 80 

tea aga ctt gat ttt ttg gat tat ttt gaa gaa aaa ctt aga aaa aga 
Ser Arg Leu Asp Phe Leu Asp Tyr' Phe Glu Glu Lys Leu Arg Lys Arg 



48 



240 



288 
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35 90 ; 95 - ■ 



gat atg agt gaa aca gaa atg caa aaa cgc tea gat act tat atg cct 
*sp Met Ser Glu- Thr Glu Met Gin Lvs Arg Leu Asp Thr Tyr Met Pro 
100 105 110 



aat ata aca age tgc act ata ggt ggt ttt gat aaa gaa aag ctt gat 
Asn lie Ala Ser Cvs Thr lie Gly Gly Phe Asp Lys Glu Lys Leu Asp 
145 150 155 ISO 

tct tat tta tea ctt gat att caa aaa gaa aga tea agt ttg gtg gtg 
S~r Tyr Leu Ser Leu Asp lie Gin Lvs Glu Arg Ser Ser Leu Val Val 
165 170 175 

get tta aga tat tgc aac gat aaa aaa aat cct caa aaa aat cgt ttt 
Ala Leu Glv Tyr Cvs Asn Aso Lys Lys Asn Pro Gin Lys Asn Arg Phe 
180 ' 135 190 

agt ttt gat gaa gtt gta aaa ttt att taa 
Ser Phe Asp Glu Val Val Lys Phe lie 
195 200 

<210> 23 
<211> 202 
<212> PRT 

<213> Campylobacter jejuni 
. <400> 23 

Met Lvs Lys Glu Leu Glu He Phe Ser Thr Arg Tyr Ser Cys Arg Asn 
1 " 5 10 15 

Phe Lys Asn Glu Lys Leu Lys Lys Glu Asp Leu Asn Ser He. Leu Glu 
20 25- 30 

He Ala Arg Leu Ser Pro Ser Ser Leu Gly Leu Glu Pro Trp Lys Phe 
35 .40 45 

He Val Val Gin Asp Glu Lys Arg Lys Glu Glu Leu Ser Lys He Cys 
50 55 60 

Asn Gin Gin Lys His Val Lys Asp Cys Ala Ala Leu He He He He 
65 ' 70 75 80 

Ser Arg Leu Asp Phe Leu Asp Tyr Phe Glu Glu Lys Leu Arg Lys Arg 
85 . 90 95 

Asp Met Ser Glu Thr Glu Met Gin Lvs. Arg Leu Asp Thr Tyr Met Pro 
100 105 110 

Phe Leu Lys Ser Leu Asn Gin Glu Gin Lys He Ser Tyr Ala Arg Glu 
115 120 . .125 

Gin Ala His He Ala Leu Ala Ser He Leu Tyr Ser Ala Asn Ala Leu 
130 135 140 

Asn He Ala Ser Cys Thr He Gly Gly Phe Asp Lys Glu Lys. Leu Asp 
145 150 155 160 



33c 



ctt tta aaa tct eta aat caa gaa caa aaa ata tct tat gca aga gaa 334 

Phe Leu Lys Ser Leu A.sn Gin Glu Gin Lys He Ser Tyr Ala Arg Glu 
115 120 125 

caa get cat ata get eta get age ata ctt tac agt get aat get tta 432 

Gin Ala His He Ala Leu Ala Ser He Leu Tyr Ser Ala Asn Ala Leu 
130 135 140 



480 



528 



576 



606 
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xvr Leu Ser Leu Asp He Gin Lys Glu Arg Ser Ser Leu Val val 
165 ~ 170 175 

Ala Leu Glv Tvr Cys Asn Asp Lvs Lvs Asn Pro Gin Lys Asn Arg Phe 
180 185 190 

Ser Phe Asp Glu Val Val Lys Phe lie 
195 200 

<210> 24 
<211> 522 
<212> DMA 

<213> Porphyromonas gingival! s 

<220> 

<221> CDS 

<222> (1) . . (522) 



<400> 24 

atg aaa aaa acg etc gta ata gtc gtt cac ccc gat ttg acc aaa tec 

M«t Lvs Lvs Thr Leu Val He Val Val His Pro Asp Leu Thr Lys Ser 

1 ~ ~ 5 10 15 

gtt ate aac aag get tgg gee aaa gee ate gaa ggt gca gee act ate 

Val lie Asn Lvs Ala Trp Ala Lvs Ala He Glu Gly Ala Ala Thr He 

20 25 30 

cac cat etc tac gaa cag tat ccg aac gga caa ate gat eta gca cat 

His His Leu Tyr Glu Gin Tyr Pro Asn Gly Gin He Asp Leu Ala His 

35 ■ 40 45 



gtc ttt act gaa age tgg gec tat ggt gee ggt gga gac aag atg gag 
Val Phe Thr Glu Gly Trp Ala Tvr Gly Ala Gly Gly Asp Lys Met Glu 
85 90 95 

ggt aaa gaa ate tgt gca gca gtc tec tgc gga tea ccc- aaa tea get 
Gly Lys Glu lie Cys Ala Ala Val Ser Cys Gly Ser Pro Lys Ser Ala 
100 105 110 

ttt gec gaa gga gca cag caa tgc cac acg ctg cga age tac ttg aat 
Phe Ala Glu Gly Ala Gin Gin Cys His Thr Leu Arg Ser Tyr Leu Asn 
115 120 125 

gta ttc gac ggg ata act get ttc ctg cgc get cga ttc acc ggc tac 
Val Phe Asp Gly He Ala Ala Phe Leu Arg Ala Arg Phe Thr Gly Tyr 
130 135 140 

cat gec tgc tac gat tec tac aat cct cgc ctg ccg gaa atg ctg ccg 
His Ala Cvs Tyr Asp Ser Tyr Asn Pro Arg Leu Pro Glu Met Leu Pro 
145 ." 150 155 160 

gec aac tgc gaa gee tat etc cgc ttt ate aaa gga gaa tga 
Ala Asn Cvs Glu Ala Tyr Leu Arg Phe He Lys Gly Glu 
165 170 



<210> 25 
<211> 174 



48 



96 



144 



gaa caa gec ctg ctg gag get cat gac 'cgc ate gtc ttc caa ttc ccc 192 

Glu Gin Ala Leu Leu Glu Ala His Asp Arg He Val Phe Gin Phe Pro 

50 55 60 

etc tat tgg tat gca get ccc tat ctg ctg aag aag tgg atg gac gag 24 0 

Leu Tyr Tro Tyr Ala Ala Pro Tyr Leu Leu Lys Lys Trp Met Asp Glu 
65 70 75 . 80 



288 



336 



384 



432 



480 



522 
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<212> PRT 

<213> Porphyromonas gingival is 
<400> 25 

Met Lys Lys Thr Leu Val lie Val Val His Pro Asp Leu Thr Lys Ser 
1*5 10 15 

Val lie Asn Lvs Ala Tro Ala Lys Ala lie Glu Gly Ala Ala Thr lie 
20 25 30 

His His Leu Tvr Glu Gin Tyr Pro Asn Gly Gin lie Asp Leu Ala His 
35 - 40 45 

Glu Gin Ala Leu Leu Glu Ala His Asp Arg He Val Phe Gin Phe Pro 
50 55 60 

Leu Tyr Trp Tyr Ala Ala Pro Tyr Leu Leu Lys Lys Trp Met Asp Glu 
65 - 70 75 30 

Val Phe Thr Glu Gly Trp Ala Tyr Gly Ala Gly Gly Asp Lys Met Glu 
85 90 95 

Glv Lys Glu He Cys Ala Ala Val Ser Cys Gly Ser Pro Lys Ser Ala 
100 105 110 

Phe Ala Glu Gly Ala Gin Gin Cys His Thr Leu Arg Ser Tyr Leu Asn 
115 ~ 120 125 

Val Phe Asd Gly He Ala Ala Phe Leu Arg Ala Arg Phe Thr Gly Tyr 
130 " 135 140 

His Ala Cys Tyr Asp Ser Tyr Asn Pro Arg Leu Pro Glu Met Leu Pro 
145 * 150 155 160 

Ala Asn Cys Glu Ala Tyr Leu Arg Phe He Lys Gly Glu 
165 170 

<210> 26 
<211> 552 
<212> DNA 

<213> Yersinia pestis 

<220> 

<221> CDS 

<222> (1) . . (552) 

<400> 26 

atg atg ttg cag ccg ccg aag gtt ttg ctg ctg tat gcc cat ccg gaa 

Met Met Leu Gin Pro Pro Lys Val Leu Leu Leu Tyr Ala His Pro Glu 

1 5 10 .15 

tea cag gac teg gtc get aac egg gtt tta ctg caa ccg gta cag cag 
Ser Gin Aso Ser Val Ala Asn Arg Val Leu Leu Gin Pro Val Gin Gin 
20 25 30 



ttt att gat att cat cat gag cag caa ttg eta cgt gat cat caa gtt 

Phe He Asp He His His Glu Gin Gin Leu Leu Arg Asp His Gin Val 

50 55 60 

act gta ttt caa cat cct tta tat act tac agt tgc cct gca tta ctg 

lie Val Phe Gin His Pro Leu Tyr Thr Tyr Ser Cys Pro Ala Leu Leu 

55 70 75 80 



48 



96 



tta gaa cat gtc act gtg cac gat ctt tat gca cat tat ccg gat ttc 144 
Leu Glu His Val Thr Val His Asp Leu Tyr Ala His Tyr Pro Asp Phe 
35 40 45 



192 



240 
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aaa gag tag ttg gat egg qta ctg gca cgt ggt ttc gec aat age gtt 
Lys Glu Trp Leu Asp Arg Val Leu Ala Arg Glv Phe Ala Asn Gly Val 
es 90 95 

ggc age cat gca ctg acg gga aag cac tgg cgc teg grg att acc acc 
Gly Gly His Ala Leu Thr Gly Lys His Trp Arg Ser Val lie Thr Thx 
100 105 110 

ggt gag cag gag gga act tac cgt att ggg gga tat aac cgt tac cca 
Glv Glu Gin Glu Gly Thr Tyr Arg lie Gly Gly Tyr Asn Arg Tyr Pro 
115 120 125 

atg gaa aac att ctg cgt cct ttc gaa ttg acg gcg get atg tgc cat 
Met Glu Asd He Leu Arg Pro Phe Glu Leu Thr Ala Ala Met Cys His 
130 " 135 140 

atg cat tgg att aat ccg atg att att tac tgg gec aga cgc caa aag 
Met His Trp He Asn Pro Met He He Tyr Trp Ala Arg Arg Gin Lys 
145 ' 150 155 . 160 

ccg gaa aca etc gee agt cac gca caa get tat gtg caa tgg ctg cag 
Pro Glu Thr Leu Ala Ser His Ala Gin Ala Tyr Val Gin Trp Leu Gin 
165 . 170 175 

tea ccg etc acg aga gga etc tga 
Ser Pro Leu Thr Arg Gly Leu 
180 

<210> 27 
<211> 184 
<212> PRT 

<213> Yersinia pestis 
<400> 27 

Met Met Leu Gin Pro Pro Lys Val Leu Leu Leu Tyr Ala His Pro Glu 
15 10 15 

Ser Gin Asp Ser Val Ala Asn Arg Val Leu Leu Gin Pro Val Gin Gin 
20 25 30 

Leu Glu His Val Thr Val His Asp Leu Tyr Ala His Tyr Pro Asp Phe 
35 .40 45 

Phe He Asp He His His Glu Gin Gin Leu Leu Arg Asp His Gin Val 
50 55 60 

He Val Phe Gin His Pro Leu Tyr Thr Tyr Ser Cys Pro Ala Leu Leu 
65 70 75 90 

Lvs Glu Trp Leu Asp Arg Val Leu Ala Arg Gly Phe Ala Asn Gly Val 
85 90 95 

Glv Gly His Ala Leu Thr Gly Lys His Trp Arg Ser Val He Thr Thr 
100 105 110 

Gly Glu Gin Glu Gly Thr Tyr Arg He Gly Gly Tyr Asn Arg Tyr Pro 
115 120 125 

Met Glu Asp He Leu Arg Pro Phe Glu Leu Thr Ala Ala Met Cys His 
130 135 140 

Met His Trp He Asn Pro Met He He Tyr Trp Ala Arg Arg Gin Lys 
145 150 155 160 

Pro Glu Thr Leu Ala Ser His Ala Gin Ala Tyr Val Gin Trp Leu Gin 
165 170 175 



288 



336 



384 



432 



480 



528 



552 
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Ser Pro Leu Thr Arg Glv Leu 
180 



<210> 28 

<211> 633 

<212> DNA 

<213> Helicobacter pylori 



<400> 28 
atgaaatttt 


tggatcaaga 


aaaaagaaga 


caattgctaa 


acgagcgcca 


ttcttgcaag 


60 


atgttcgaca 


gccattatga 


gttttctagt 


gaagaattag 


aagaaatcgc 


tgaaatcgct 


120 


aggctatcgc 


caagctctta 


caacacgcag 


ccatggcatt 


ttgtgatggt 


tactaataag 


180 


gatttaaaaa 


aacaaattgc 


agcgcacagc 


tattttaatg 


aagaaatgat 


taaaagcgct 


240 


tcagcgttaa 


tggtggtatg 


ctctttaaaa 


cccagcgagt 


tgttacccac 


rggccactac 


300 


atgcaaaacc 


tttacccgga 


gtcttataag 


gttagagtga 


tcccctcttt 


tgctcaaatg 


360 


cttggcgtga 


gattcaacca 


cagcatgcaa 


aaattagaaa 


gctatatttt 


ggagcaacgc 


420 


tatatcgctg 


tggggcaaat 


ttgcatgggc 


gtgagcttaa 


tgggattgga 


tagttgcatt 


480 


attggaggct 


ttgatccttt 


aaaagtgggc 


gaagttttag 


aagagcgtat 


caataaacct 


540 


aaaatcgcac 


gcttgatcgc 


tttgggcaag 


agggtggcag 


aagcgagcca 


aaaarcaaga 


600 


aaatcaaaag 


ttgatgccat 


tacttggttg 


tga 






633 



<210> 29 
<211> 210 
<212> PRT 

<213> Helicobacter pylori 
<400> 29 

Met Lys Phe Leu Asp Gin Glu 

1 .5 

His Ser Cvs Lys Met. Phe Asp 
20 

Leu Glu Glu He Ala Glu He 
35 

Thr Gin Pro Trp His Phe Val 

50 55 

Gin He Ala Ala His Ser Tyr 
65 70 

Ser Ala Leu Met Val Val Cys 
85 

Thr Gly His Tyr Met Gin Asn 
100 

Val He Pro Ser Phe Ala Gin 
115 

Met Gin Lys Leu Glu Ser Tyr 
130 135 

Gly Gin lie Cys Met Gly Val 



Lys Arg Arg Gin Leu Leu Asn Glu Arg 
10 15 

Ser His Tyr Glu Phe Ser Ser Glu Glu. 
25 30 

Ala Arg Leu Ser Pro Ser Ser Tyr Asn 
40 45 

Met Val Thr Asn Lys Asp Leu Lys Lys 
60 

Phe Asn Glu Glu Met He Lys Ser Ala 
75 80 

Ser Leu Lys Pro Ser Glu Leu Leu Pro 
. 90 95 

Leu Tyr Pro Glu Ser Tyr Lys Val .Arg- 
105 " 110 

Met Leu Gly Val .Arg Phe Asn His Ser 
120 " 125 

He Leu Glu Gin Cys Tyr He Ala Val 
140 

Ser Leu Met Gly Leu Asp Ser Cys He 
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145 . 150 155 160 

lie Glv* Gly Phe Asp Pro Leu Lvs Val Gly Glu Val Leu Glu Glu Arg 
155 170 175 

lie Asn Lys Pro Lys lie Ala Cvs Leu lie Ala Leu Gly Lys Arg Val 
180 * 135 190 

Ala Glu Ala'Ser Gin Lys Ser.Arg Lys Ser Lys Val Asp Ala He Thr 
195 200 205 

Trp Leu 
210 



SUBSTITUTE SHEET (RULE 26) 



INTERNATIONAL SEARCH REPORT 



Intei onal Application No 

PCT/G8 00/00431 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 C12N9/02 C12N15/52 A61K35/00 



According to International Patent Classification (IPC) or 10 both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C12N 



Documentation searched otner than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name ol data base and, where practical, search terms used) 



WPI Data, EPO-Internal , STRAND, BIOSIS 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Citation of document, with indication, where appropriate, ol the relevant passages 



Relevant to claim No. 



EP 0 540 263 A (CANCER RES CAMPAIGN TECH) 
5 May 1993 (1993-05-05) 
cited in the application 
the whole document 

WO 95 12678 A (CONNORS THOMAS ;KN0X 

RICHARD (GB); SHERWOOD ROGER (GB); CANCER 

RES) 11 May 1995 (1995-05-11) 

the whole document 

especially figure 6, examples 1-4 

DE 42 21 830 A ( BIOTECHNOLOG F0RSCHUNG 
GMBH) 28 January 1993 (1993-01-28) 
the whole document 



1-3,5-28 



1-3,5-28 



1-3,5-28 



| X | Furthsr documents are listed in Ihe continuation of box C 



ID 



Patent family members are listed in annex. 



3 Special categories of cited documents : 

"A" document defining the general state of the art which is not 

considered to be of particular relevance 
"E" earlier document but published on or after the International 

filing date 

"L* document which may throw doubts on priority ciaim(s) or 
which is cited to establish the publication date ol another 
citation or other special reason (as specified) 

O" document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior tc the international tiling date but 
later than the pnority date claimed 



later document published after the international filing date 
or priority date and not "m conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

' document ol particular relevance; the claimed invention 
cannot be considered novel cr cannot be considered to 
involve an inventive step when the document is taken alone 

' document of particular relevance; the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

' document member of the same parent family 



Date of thg actual completion ol the international search 



13 July 2000 



Date of mailing of the international search report 



25/07/2000 



Name and mailing address of the ISA 

European Patent Office. P.B. 5618 Patentlaan 2 
ML - 2280 HV Rijswijk 
Tel. (-31-70) 340-2040. Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 



Authorized officer 



Panzica, G 



Form PCT/ISA/21 0 (second sheet) i July 1992) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



intei onal Application No 

PCT/GB 00/00431 



C. (Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category *' Citation of document, with indicatlon.where appropriate, o( ths relevant pa&sag&s 



Relevant to claim No. 



EP 0 547 876 A (CHISSO CORP) 
23 June 1993 (1993-06-23) 
abstract 

claim 4; figure 4 

ANTELMANN H. ET AL . : "First step from a 
two-dimensional protein index towards a 
response-regulation map for Bacillus 
subtil is" 
ELECTROPHORESIS, 

vol. 18, no. 8, 1997, pages 1451-1463, 

XP000923464 

the whole document 

WO 98 57662 A (BURKE PHILIP JOHN ;ENZACTA 
R&D LTD (GB); KNOX RICHARD JOHN (GB) ) 
23 December 1998 (1993-12-23) 
abstract 
figure 6; example 1 



1-3,5-28 



1-3,5-28 



1-3,5-28 



Form PCT/1SA/210 t continuation oi second sheet) (Jury 1092) 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 

Information on patent family members 



Intct onai Application No 

PCT/GB 00/00431 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



AU 


681337 


B 


28-08-1997 


AU 


2776992 


A 


21-05-1993 


AU 


3521597 


A 


19-02-1998 


CA 


2122036 


A 


29-04-1993 


EP 


0638123 


A 


15-02-1995 


WO 


9308288 


A 


29-04-1993 


JP 


7501692 


T 


23-02-1995 


US 


5977065 


A 


02-11-1999 


us 


5633158 


A 


27-05-1997 


us 


5780585 


A 


14-07-1998 



EP 0540263 



05-05-1993 



WO 9512673 A 11-05-1995 AU 690935 B 07-05-1998 

AU 8065794 A 23-05-1995 

CA 2175687 A 11-05-1995 

EP 0725826 A 14-08-1996 

JP 9505037 T 20-05-1997 

NZ 275147 A 24-11-1997 

US 5958682 A 23-09-1999 



DE 


4221830 


A 


28-01-1993 


NONE 








EP 


0547876 


A 


23-06-1993 


JP 


2090512 


C 


18-09-1996 










JP 


6007171 


A 


18-01-1994 










JP 


8000075 


B 


10-01-1996 










US 


5468631 


A 


21-11-1995 


WO 


9857662 


A 


23-12-1998 


EP 
GB 


0988059 
2341605 


A 
A 


29-03-2000 
22-03-2000 



Form PCT/ISA/St 0 (patent tamBy annex) (July 1992) 



